多样性(控制论)
色散(光学)
变量(数学)
度量(数据仓库)
语言学
计算机科学
学位(音乐)
航程(航空)
计量经济学
心理学
统计
人工智能
数学
哲学
物理
声学
数据挖掘
光学
复合材料
数学分析
材料科学
标识
DOI:10.1075/ijcl.13.4.02gri
摘要
The most frequent statistics in corpus linguistics are frequencies of occurrence and frequencies of co-occurrence of two or more linguistic variables. However, such frequencies in isolation may sometimes be misleading since they do not take into consideration the degree of dispersion of the relevant linguistic variable. Many dispersion measures and adjusted frequency measures have been suggested but are neither widely known nor applied. Another unfortunate aspect of such measures is that many also come with a variety of problems. I pursue three objectives with this article. First, I want to raise awareness of this issue and make the available measures more widely known, so I present an overview of many measures of dispersion and adjusted frequencies. Second, I propose a conceptually simple alternative measure, DP , explain and exemplify it, and compare it to previously discussed measures. Third and most importantly, I urge corpus linguists to explore the notion of dispersion in more detail and outline a few proposals which steps to take next.
科研通智能强力驱动
Strongly Powered by AbleSci AI