文字2vec
人工智能
计算机科学
模式识别(心理学)
特征(语言学)
背景(考古学)
特征学习
词(群论)
自然语言处理
图形
特征提取
数学
理论计算机科学
语言学
几何学
嵌入
生物
古生物学
哲学
作者
Boting Liu,Weili Guan,Zhijie Fang
摘要
Syndrome differentiation (SD) is a basic task in TCM (Traditional Chinese Medicine) diagnosis and treatment. TCM syndrome differentiation is very complex and time-consuming. Meanwhile, the accuracy of the results depends on the experience of TCM practitioners. To help TCM practitioners differentiate syndrome more quickly, we propose a syndrome differentiation method of deep learning based on multi-feature fusion. We extracted char, word and POS (Part of Speech) from TCM diagnosis and treatment records. The vector representation of char feature is obtained by ZY-BERT (Zhong Yi BERT), ZY-BERT was pre-trained on large datasets of TCM-SD (TCM Syndrome Differentiation). The vector representation of word and POS is obtained by Word2vec (Word to vector). We construct text graphs of char, word and POS according to context. GCN (Graph Convolutional Networks) is used to extract spatial structure information between multiple features to achieve multi-feature fusion. The experiment was carried out on TCM-SD. The experimental results showed that the accuracy of the proposed method was 81.52%, which was better than the comparison method. This method is helpful in the development of TCM modernization.
科研通智能强力驱动
Strongly Powered by AbleSci AI