按频率列出的单词列表
词(群论)
词汇判断任务
计算机科学
自然语言处理
德国的
字长
人工智能
语音识别
心理学
语言学
认知
哲学
神经科学
判决
作者
Marc Brysbaert,M. Büchmeier,Markus Conrad,Arthur M. Jacobs,Jens Bölte,Andrea Böhl
出处
期刊:Experimental psychology
[Hogrefe Publishing Group]
日期:2011-07-01
卷期号:58 (5): 412-424
被引量:439
标识
DOI:10.1027/1618-3169/a000123
摘要
We review recent evidence indicating that researchers in experimental psychology may have used suboptimal estimates of word frequency. Word frequency measures should be based on a corpus of at least 20 million words that contains language participants in psychology experiments are likely to have been exposed to. In addition, the quality of word frequency measures should be ascertained by correlating them with behavioral word processing data. When we apply these criteria to the word frequency measures available for the German language, we find that the commonly used Celex frequencies are the least powerful to predict lexical decision times. Better results are obtained with the Leipzig frequencies, the dlexDB frequencies, and the Google Books 2000–2009 frequencies. However, as in other languages the best performance is observed with subtitle-based word frequencies. The SUBTLEX-DE word frequencies collected for the present ms are made available in easy-to-use files and are free for educational purposes.
科研通智能强力驱动
Strongly Powered by AbleSci AI