The construction and application of mandarin Chinese multi-level annotated corpus
现代汉语多级标注语料库的建设及其应用
Huiming Duan 段慧明; Xuefeng Zhu 朱学锋; Shiwen Yu 俞士汶; Yunfang Wu 吴云芳

Abstract 摘要
The Comprehensive Language Knowledge Base (CLKB) which has been under construction by the authors and the colleagues of Institute of Computational Linguistics at Peking University since 1986. Mandarin Chinese multi-level annotated corpus is one of the important language knowledge bases of CLKB. After a brief introduction of CLKB, this paper describes the leading ideas, the achievement and application of our multi-level annotated corpus.

本文作者与北京大学计算语言学研究所(ICL/PKU)的同仁一道,自1986年起积25年之努力建成“综合型语言知识库” (简称CLKB)。现代汉语多级标注语料库是CLKB中的一项重要的语言知识库。本文在介绍CLKB的概要之后,1论述ICL/PKU研制多级标注语料库的理念、已经取得的成果及其应用情况。

Keywords 关键词

Computational linguistics 计算语言学 Chinese information processing 中文信息处理 Comprehensive Language Knowledge Base 综合型语言知识库 Mandarin Chinese Multi-level Annotated Corpus 现代汉语多级标注语料库 Grammatical Knowledge-base of Contemporary Chinese 现代汉语语法信息词典

Article 文章

<< Back 返回

Readers 读者