计算机科学
人工智能
深度学习
计算机视觉
机器学习
作者
YoungSeok Jeon,Kensuke Yoshino,Shigeo Hagiwara,Atsuya Watanabe,Swee Tian Quek,Hiroshi Yoshioka,Mengling Feng
标识
DOI:10.1109/jbhi.2021.3081355
摘要
We propose an interpretable and lightweight 3D deep neural network model that diagnoses anterior cruciate ligament (ACL) tears from a knee MRI exam. Previous works focused primarily on achieving better diagnostic accuracy but paid less attention to practical aspects such as explainability and model size. They mainly relied on ImageNet pre-trained 2D deep neural network backbones, such as AlexNet or ResNet, which are computationally expensive. Some of them tried to interpret the models using post-inference visualization tools, such as CAM or Grad-CAM, which lack in generating accurate heatmaps. Our work addresses the two limitations by understanding the characteristics of ACL tear diagnosis. We argue that the semantic features required for classifying ACL tears are locally confined and highly homogeneous. We harness the unique characteristics of the task by incorporating: 1) attention modules and Gaussian positional encoding to reinforce the seeking of local features; 2) squeeze modules and fewer convolutional filters to reflect the homogeneity of the features. As a result, our model is interpretable: our attention modules can precisely highlight the ACL region without any location information given to them. Our model is extremely lightweight: consisting of only 43 K trainable parameters and 7.1 G of Floating-point operations per second (FLOPs), that is 225 times smaller and 91 times lesser than the previous state-of-the-art, respectively. Our model is accurate: our model outperforms the previous state-of-the-art with the average ROC-AUC of 0.983 and 0.980 on the Chiba and Stanford knee datasets, respectively.
科研通智能强力驱动
Strongly Powered by AbleSci AI