计算机科学
人工智能
预处理器
一般化
机器学习
杠杆(统计)
模式识别(心理学)
多任务学习
对比度(视觉)
分类器(UML)
任务(项目管理)
数学
管理
经济
数学分析
作者
G. G. Chen,Chih‐Ting Lin
标识
DOI:10.1016/j.asoc.2024.111478
摘要
Global pandemics have posed great challenges, such as limited samples and the scarcity of carefully curated datasets, in creating reliable models for chest X-ray (CXR) diagnosis. A common approach is to leverage pre-trained deep learning models using large datasets and then fine-tune model parameters on the target CXR dataset. However, in the presence of data bias, it is prone to shortcut learning, where the model learns non-disease-related features. To enable the model to learn robust features from the target dataset, we propose a new training framework called Multi-task Supervised Contrastive Learning (MTSCL). In MTSCL, we concatenate the weighted representations of two supervised tasks and implicitly contrast the representations from the large-scale and target datasets. It helps the model focus on disease-related features in the target dataset. This paper applies the MTSCL model to a two-stage hierarchical classification framework. In the first stage, the MTSCL model detects lung abnormalities, and then tree-based models are applied to classify the abnormalities as pneumonia or COVID-19 in the second stage. The experimental results show that, in the first stage, the MTSCL model outperforms other training methods in terms of generalization with an average AUC improvement of 1.03%-5.33% and has better abnormality localization capability. In the second stage, the model generalization is enhanced by incorporating lung masks and using MTSCL model-generated lesion masks for image preprocessing. Furthermore, the addition of wavelet components manipulation on top of the previous improvements further increases the generalization performance for distinguishing COVID-19 from typical pneumonia with an average AUC improvement of 12.38%. Overall, the two-stage hierarchical classification framework exhibits superior generalization ability for normal, pneumonia, and COVID-19 classification, achieving an average accuracy of 75.19% in two external datasets, outperforming state-of-the-art deep learning models (71.00%-74.08%) and classical machine learning models (67.80%-68.67%).
科研通智能强力驱动
Strongly Powered by AbleSci AI