Extracting Chinese multi-word units from large-scale balanced corpus

首页 > 成果 > 详情

认领

导出

反馈

作者信息关键词期刊信息归属信息摘要

成果类型：

期刊论文

作者：

Liu, JZ;He, TT（何婷婷）;Xiaohua, LH

作者机构：

[He, TT; Xiaohua, LH; Liu, JZ] Cent China Normal Univ, Dept Comp Sci, Wuhan 430079, Peoples R China.

语种：

英文

期刊：

PACLIC 17: Language, Information and Computation, Proceedings

年：

2003

页码：

282-289

机构署名：

本校为第一机构

院系归属：

计算机学院

摘要：

Automatic Multi-word Units Extraction is an important issue in Natural Language Processing. This paper has proposed a new statistical method based on a large-scale balanced corpus to extract multi-word units. We have used two improved traditional parameters: mutual information and log-likelihood ratio, and have increased the precision for the top 10,000 words extracted through the method to 80.13%. The results of the research indicate that this method is more efficient a...

反馈

产权有误：本人成果被他人认领

数据有误：数据基本信息有误

归属有误：成果的院系归属、机构署名归属有误

其他原因：

验证码：

看不清楚，换一个

确定

取消

成果认领

标题：

用户	作者	通讯作者	--
	请选择	请选择	--

确定

取消

提示

该栏目需要登录且有访问权限才可以访问

如果您有访问权限，请直接登录访问

如果您没有访问权限，请联系管理员申请开通

管理员联系邮箱：yun@hnwdkj.com