Comment on global Chinese corpus
汉语融合时代的语言研究与语料库
Cheng Hai Chew 周清海

Abstract 摘要
"The development of the modern Chinese language can be divided into two stages. The first stage, from 1949 to the reform and opening of China, is a stage of division. With the reform and opening of China, the modern Chinese language has entered into the stage of integration. The essential aspects, including the lexical and syntactic phenomena, of the Chinese language called “the National Language” before 1949 are still existent in the Chinese languages of various Chinese speech communities. However, because of social influences, the Chinese languages in various communities today have acquired different features. But not much attention has been paid to such variations and researches are few and far between. The Linguistic Variation of Chinese Speech Communities (LIVAC) Corpus initially developed by the Language Information Sciences Research Centre, City University of Hong Kong, has therefore attracted my special attention. This is a synchronous Chinese corpus. If properly and fully utilized, it can help to enhance the Chinese language teaching not just in Hong Kong, but also in other communities. The corpus can certainly play a more important role in the promotion of the Chinese language worldwide. The relevant institutes of Hong Kong and China should give it more support so that it can develop into a more comprehensive corpus of global relevance. The building of a corpus of the Chinese language at the divided stage and a separate one at the integration stage is of high significance and value. Our concern, however, should not be limited to the building of a corpus of modern Chinese language at the present time. I would argue that we should do more research on the Chinese language during the transitional period of pre-modern to the modern period which is lacking at the moment. Many new terms in modern Chinese language were in fact products of translations by western evangelists, rather than of Japanese origin. In order to change the worldview of the Chinese, western evangelists made great efforts to introduce to China western geographic knowledge as well as political idea. In such an attempt, they had to resort to coining some new terms in Chinese. In the past, because of the lack of a comprehensive picture, people had wrongly attributed many of these new coinages to the Japanese. In the study of early modern Chinese language, therefore, one must have a global perspective. The building of a corpus of early modern Chinese language will enable us to have a more comprehensive understanding of the Chinese language at the early modern stage.

现代汉语可以分为两个阶段。1949年到中国改革开放之前,是一个阶段,可以叫做现代汉语的分裂阶段。中国改革开放之后到现在,是现代汉语的融合阶段。1949年以前的“国语”,无论词汇或语法现象,都保留在各地的华语里。加上华语区多语社会的影响,使华语出现许多特点。这些部分我们以前都关心得不夠,研究也做得不多。我一直关心语言资讯科学研究中心的泛华语地区汉语共时语料库(LIVAC)。如果能充分利用这个共时语料库,将能推动香港的语文教学,甚至是其它地区的华文教学。在汉语的世界传播方面,这个语料库将能起到更大的作用。香港及中国的相关单位,应该给这个语料库更大的支持,让它发展成为世界性的,涵盖面最广的语料库。以汉语的分裂和融合来分别建立语料库,也是非常有意义的,有价值的。我们不应该只为现代汉语的现况建立语料库。从近代汉语过渡到现代汉语,也有许多研究不足的地方。现代汉语的新词有很多是传教士翻译的,而不是从日本传入的。传教士要把西方的地理知识、政治知识介绍到中国来,以期改变中国人以中国为世界中心的观念,不得不创造汉语新词。我们对过去的了解不够,而误把新词的创造权归给了日本。对早期现代汉语的研究,需要有世界眼光。为早期现代汉语建立语料库,能让我们更全面地了解早期现代汉语的情况。

Keywords 关键词

Integration of the Chinese language 汉语融合 Corpus 语料库 Ancient and modern Chinese 古今汉语 Foreign loanwords in Chinese 汉语外来词

Article 文章

<< Back 返回

Readers 读者