语素
自然语言处理
计算机科学
人工智能
语义变化
代表(政治)
语义学(计算机科学)
语言学
过程(计算)
统计的
数学
哲学
操作系统
统计
程序设计语言
法学
政治
政治学
作者
Yang Chi,Fausto Giunchiglia,Hao Xu
出处
期刊:Electronics
[Multidisciplinary Digital Publishing Institute]
日期:2024-04-30
卷期号:13 (9): 1728-1728
标识
DOI:10.3390/electronics13091728
摘要
Lexical semantic changes spanning centuries can reveal the complicated developing process of language and social culture. In recent years, natural language processing (NLP) methods have been applied in this field to provide insight into the diachronic frequency change for word senses from large-scale historical corpus, for instance, analyzing which senses appear, increase, or decrease at which times. However, there is still a lack of Chinese diachronic corpus and dataset in this field to support supervised learning and text mining, and at the method level, few existing works analyze the Chinese semantic changes at the level of morpheme. This paper constructs a diachronic Chinese dataset for semantic tracking applications spanning 3000 years and extends the existing framework to the level of Chinese characters and morphemes, which contains four main steps of contextual sense representation, sense identification, morpheme sense mining, and diachronic semantic change representation. The experiment shows the effectiveness of our method in each step. Finally, in an interesting statistic, we discover the strong positive correlation of frequency and changing trend between monosyllabic word sense and the corresponding morpheme.
科研通智能强力驱动
Strongly Powered by AbleSci AI