变化(天文学)
弹性网正则化
计算机科学
人工智能
自然语言处理
回归
语义学(计算机科学)
回归分析
语言学
统计
数学
特征选择
机器学习
物理
哲学
天体物理学
程序设计语言
作者
Anthe Sevenants,Freek Van de Velde,Dirk Speelman
标识
DOI:10.1515/cllt-2024-0068
摘要
Abstract This article showcases elastic net regression as a means to build fairer models of morphosyntactic variation. Elastic net allows lexical items to appear on the same level as traditional, high-level predictors, enabling fuller models of variation. We apply elastic net regression to 1,296,574 Dutch verbal cluster tokens from the SoNaR corpus, analysing a morphosyntactic alternance in Dutch subordinate clauses. Our results show morphosyntactic preferences among verbs, indicating that semantic effects are indeed at play. Further analysis shows that semantic patterns for either word order exist, though it remains difficult to glean any semantic generalisations. Still, the elastic net technique shows that the inclusion of lexical items as full predictors in a model is useful, as much of the variation left unexplained by high-level predictors can be explained in lexical terms.
科研通智能强力驱动
Strongly Powered by AbleSci AI