An articulatory model of standard Chinese using MRI and X-ray movie
用磁共振成像和X光声道资料建立汉语普通话调音模型
Gaowu Wang 汪高武; Jiangping Kong 孔江平
Abstract 摘要
To better understand speech production from the phonological inputs to articulatory movements and then to acoustic outputs, it is important to establish an elaborate articulatory model of the vocal tract. This paper has explored the articulatory mechanism of speech production in Standard Chinese and developed a geometric articulatory model in both the visual and acoustic modalities.1 This model was based on the data of MRI images and X-ray movie, with the former providing detailed volumetric information of the vocal tract, and the latter the dynamic information of articulation. In this model, the seven articulators have been studied and modeled, including the hard palate, pharynx, jaw, lips, velum, tongue, and larynx. The tongue is modeled as two parts: tongue tip and tongue body, thus reducing the necessary number of parameters. The relation between larynx height and fundamental frequency in regard to the four tones is also modeled. These two improvements on tongue and larynx modeling have contributed new ideas to the articulatory modeling of Standard Chinese. This model can serve as a research tool for linguists, phoneticians, and speech engineers, and can be used in parameter speech synthesis, virtual speaker, and visual assistant speech training of Standard Chinese.
Keywords 关键词
Speech production 言语产生 Articulatory model 调音模型 Vocal tract 声道 MRI 磁共振成像 X-ray Movie X光