The construction and application of mandarin Chinese multi-level annotated corpus
现代汉语多级标注语料库的建设及其应用
Huiming Duan 段慧明; Xuefeng Zhu 朱学锋; Shiwen Yu 俞士汶; Yunfang Wu 吴云芳
Abstract 摘要
The Comprehensive Language Knowledge Base (CLKB) which has been under construction by the authors and the colleagues of Institute of Computational Linguistics at Peking University since 1986. Mandarin Chinese multi-level annotated corpus is one of the important language knowledge bases of CLKB. After a brief introduction of CLKB, this paper describes the leading ideas, the achievement and application of our multi-level annotated corpus.
Keywords 关键词
Computational linguistics 计算语言学 Chinese information processing 中文信息处理 Comprehensive Language Knowledge Base 综合型语言知识库 Mandarin Chinese Multi-level Annotated Corpus 现代汉语多级标注语料库 Grammatical Knowledge-base of Contemporary Chinese 现代汉语语法信息词典