A multimedia corpus of child Mandarin: the Tong corpus
一个多媒体汉语普通话儿童语料库:同语料库
Xiangjun Deng 邓湘君; Virginia Yip 叶彩燕
Abstract 摘要
This article features a new multimedia corpus with 22 hours of recordings of a Mandarin-speaking child from the age of 1;7 to 3;4. We review the state of the art in the use of corpora for first language acquisition of Mandarin, and highlight the importance of corpus studies in evaluating children’s language developmental patterns vis-a-vis adult input. The transcripts in our new corpus are annotated with a morphological tier indicating parts of speech, and linked to audio or video files. This corpus goes beyond existing published corpora of child Mandarin in having more data for a single child, as well as media linking. It contributes to a number of fields including language acquisition, Chinese linguistics, corpus linguistics, developmental psycholinguistics, education, and speech and language therapy.
Subject Keywords 主题词
Child language corpus 儿童语料库 Mandarin Chinese 汉语普通话 Language input 语言输入
Media linking 多媒体链接 Morphological tier 词类注释层
Journal of Chinese Linguistics vol.46, no.1 (January 2018): 69-92
Copyright © 2018 Journal of Chinese Linguistices. All rights reserved.