普通话
话语
计算机科学
电话
对话
中国大陆
语用学
自然语言处理
语料库语言学
人工智能
语言学
中国
历史
哲学
考古
作者
Guodong Yu,Yanyan Wu,Paul Drew,Chase Wesley Raymond
出处
期刊:Chinese language and discourse
[John Benjamins Publishing Company]
日期:2023-12-14
标识
DOI:10.1075/cld.23001.guo
摘要
Abstract This paper introduces the DMC Corpus – a newly collected dataset of 150 mundane cell phone calls from Mainland China in Mandarin Chinese (audio and detailed transcripts) – which is now publicly available for use in research and teaching. In this report, we first describe the constitution and current contents of the DMC Corpus, as well as instructions for access. Additional calls will be added periodically to the Corpus, and so the quantitative overview presented here should be considered conservative. We then provide concrete examples of the sorts of phenomena that might be explored with these new data, underscoring how the Corpus offers researchers the ability to build systematic collections for analysis – no matter whether researchers prefer to begin with ‘forms’ (e.g., utterance-final particles), with ‘functions’ (e.g., complaining), and/or with the temporal organization of interaction itself (e.g., preference organization, repair). The paper concludes with an explicit call for increased research on Mandarin conversation, to which we hope the materials in the DMC Corpus will contribute.
科研通智能强力驱动
Strongly Powered by AbleSci AI