Dynamic attribute and change of Chinese lexicon
汉语词汇动态属性与变异

Chin-Chuan Cheng 郑锦全

Abstract 摘要
This study makes a distinction between word and lexicon. A word is an individual lexical item. The lexicon is an aggregate of words. Past discussions of differences among lexicons could only list individual words for comparison. In this study, words of digital texts in Old Chinese, Pre-Modern Chinese, Modern Chinese, ancient poetic writings and modern press releases were examined and a small number of significant lexical elements were extracted to capture the distinct nature of the lexicons. The crucial point of distinction is how words are used and not whether particular words exit. Therefore, the lexical attribute discussed here is called dynamic attribute. The words in the text streams were tabulated for their frequency and percentage of the occurrence with respect to the entire texts. The cumulative percentage of occurrences of the 15 highest frequency words was also tabulated. It was then used as the concentration level of high-frequency words in use. This concentration level clearly differentiates the types of texts used in Old Chinese, Pre-Modern Chinese, Modern Chinese, ancient poetic writings and modern press releases. It is hoped that this quantitative lexical attribute may be of some use in future lexicological research.

本文区分词语和词汇,词语是个别词,词汇是词语的集合体。过去讨论不同词汇的差异都只能列举词语的异同,本文考查上古汉语、近代汉语、现代汉语、古代韵文以及现代汉语新闻稿文字,从中提炼出少数词汇要素。区别的关键在于词语的使用而不在于词语的有无。因此,本文所提出的从最高词频往下累积,得出词频在全部词语数目中的百分比。我们以词频统计中最高的15个词语的词频累积百分比作为高频词集中度,以高频词集中度当作词汇动态特性。从文本计算出来的词汇动态特性能清楚划分出所考查的古代、近代、现代、古代韵文和现代新闻稿的词汇属性。希望这个计量性质的词汇属性对今后的词汇研究有些助益。

Keywords 关键词

Word and lexicon 词语与词汇 Lexical dynamic attribute 词汇动态属性 Cumulative frequency percentage 词频累积百分比 Concentration level of high-frequency words 高频词集中度


Journal of Chinese Linguistics Monograph Series (ISSN 2409-2878), Number 27 (2017): 47-66
Copyright © 2017 Journal of Chinese Linguistices. All rights reserved.

Article 文章

<< Back 返回

Readers 读者