计算机科学
稳健性(进化)
人工智能
深层神经网络
一致性(知识库)
人工神经网络
深度学习
模式识别(心理学)
编码(集合论)
财产(哲学)
机器学习
生物化学
化学
哲学
集合(抽象数据类型)
认识论
基因
程序设计语言
作者
Seungmin Oh,Namkug Kim,Jongbin Ryu
标识
DOI:10.1038/s41598-024-58382-3
摘要
Abstract In this paper, we introduce in-depth the analysis of CNNs and ViT architectures in medical images, with the goal of providing insights into subsequent research direction. In particular, the origins of deep neural networks should be explainable for medical images, but there has been a paucity of studies on such explainability in the aspect of deep neural network architectures. Therefore, we investigate the origin of model performance, which is the clue to explaining deep neural networks, focusing on the two most relevant architectures, such as CNNs and ViT. We give four analyses, including (1) robustness in a noisy environment, (2) consistency in translation invariance property, (3) visual recognition with obstructed images, and (4) acquired features from shape or texture so that we compare origins of CNNs and ViT that cause the differences of visual recognition performance. Furthermore, the discrepancies between medical and generic images are explored regarding such analyses. We discover that medical images, unlike generic ones, exhibit class-sensitive. Finally, we propose a straightforward ensemble method based on our analyses, demonstrating that our findings can help build follow-up studies. Our analysis code will be publicly available.
科研通智能强力驱动
Strongly Powered by AbleSci AI