Chinese network media monitoring corpus and empirical investigation of language use on blog
汉语网络媒体监测语料库及博客语言应用调查
Tingting He 何婷婷; Xinhui Tu 涂新辉

Abstract 摘要
Chinese network media monitoring corpus, which includes network news, blog and forum texts, has been maintained by network media branch of national language resources monitoring and research center since 2005. Based on the corpus’s blog texts in 2009 and 2010, we compare some characteristics of language use for famous blog users and general blog users. And we also conduct a comparative analysis of characteristics of language use on female users and male users.

"汉语网络媒体监测语料库由国家语言资源监测与研究中心网络媒体语言分中心1 2005年开始建设,包含网络新闻、博客、论坛的语料。基于该语料库2009年、2010年的博客语料的统计数据,对比分析了一般博客用户和名博在博客发帖量、用字用语上的特点;基于不同性别的作者的博客文本,对比分析了男、女性作者在用字用词上的特点。

Keywords 关键词

Corpus 语料库 Blog 博客 Language use survey 语言调查 Gender linguistics 性别语言 Frequency ratio 频率比值

Article 文章

<< Back 返回

Readers 读者