计算机科学
人工智能
利用
编码(社会科学)
机器学习
数据挖掘
源代码
块(置换群论)
模式识别(心理学)
统计
几何学
计算机安全
数学
操作系统
作者
Yuzhou Wu,Zhigang Chen,Xin Yao,Xuechen Chen,Zeren Zhou,Jian Xue
标识
DOI:10.1109/jbhi.2022.3189404
摘要
The International Classification of Diseases (ICD) code is a disease classification method formulated by the World Health Organization(WHO). ICD coding usually requires clinicians to manually allocate ICD codes to clinical documents, which is labor-intensive, expensive, and error-prone. Therefore, many methods have been introduced for automatic ICD coding. However, most of the methods have ignored or cannot combine two essential features well: long-tailed label distribution and label correlation. In this paper, we propose a novel end-to-end Joint Attention Network (JAN) to solve these two problems. JAN includes Document-based attention and Label-based attention to capture semantic information from clinical document text and label description, respectively, which helps solve the classification of dense and sparse data in long-tailed label distribution. Besides, an Adaptive fusion layer and CorNet block are presented to adaptively adjust the weight of these two attentions and exploit label co-occurrence relations, respectively. Experiments on the MIMIC-III and MIMIC-II datasets demonstrate that our proposed JAN outperformed previous state-of-art methods achieving Micro-F1 of 0.553, Micro-AUC of 0.989 and precision at top 8(P@8) of 0.735. Finally, we also provide attention and label correlation visualization to verify the effectiveness of our model and improve the interpretation of our deep learning-based method.
科研通智能强力驱动
Strongly Powered by AbleSci AI