语法化
熵(时间箭头)
多样性(政治)
数学
自然语言处理
计算机科学
语言学
社会学
哲学
热力学
物理
人类学
标识
DOI:10.1080/09296174.2024.2395072
摘要
As the process of grammaticalization unfolds, it remains to be determined whether a word could co-occur with more words in contexts or would be restricted to fewer words. Based on the Lancaster Corpus of Mandarin Chinese (LCMC), this study examines the differences in colligation diversity between lexical and grammatical words in Chinese by using entropy, aiming to explore how the colligational behaviour of the left and right sides of Chinese words changes accordingly with increasing grammaticalization. The comparisons of colligation diversity between two sides and across word categories reveal that lexical words show quite similar levels of colligation diversity on the left, which makes them significantly different from grammatical words. More category-specific observations are disclosed by entropy-based approach. In the case of grammaticalization, an increase in entropy values denotes more types with a more uniform distribution, which is suggested to be the manifestation of semantic bleaching. Conversely, a decrease in entropy values may be an indicator of an increasing bondedness. The discussion on how grammaticalization affects the colligational behaviour of words should be based on the specific pathways of grammaticalization concerning word categories, as well as specific sides of words.
科研通智能强力驱动
Strongly Powered by AbleSci AI