Chinese network media monitoring corpus and empirical investigation of language use on blog
汉语网络媒体监测语料库及博客语言应用调查
Tingting He 何婷婷; Xinhui Tu 涂新辉
Abstract 摘要
Chinese network media monitoring corpus, which includes network news, blog and forum texts, has been maintained by network media branch of national language resources monitoring and research center since 2005. Based on the corpus’s blog texts in 2009 and 2010, we compare some characteristics of language use for famous blog users and general blog users. And we also conduct a comparative analysis of characteristics of language use on female users and male users.
Keywords 关键词
Corpus 语料库 Blog 博客 Language use survey 语言调查 Gender linguistics 性别语言 Frequency ratio 频率比值