版权说明 操作指南
首页 > 成果 > 详情

Measuring bilingual corpus comparability

认领
导出
Link by DOI
反馈
分享
QQ微信 微博
成果类型:
期刊论文
作者:
Li, Bo*;Gaussier, Eric;Yang, Dan
通讯作者:
Li, Bo
作者机构:
[Li, Bo] Cent China Normal Univ, Dept Comp Sci, Wuhan, Hubei, Peoples R China.
[Gaussier, Eric] Univ Grenoble Alpes, CNRS, LIG, AMA, Grenoble, France.
[Yang, Dan] China Elect Power Res Inst, Wuhan, Hubei, Peoples R China.
通讯机构:
[Li, Bo] C
Cent China Normal Univ, Dept Comp Sci, Wuhan, Hubei, Peoples R China.
语种:
英文
关键词:
Software engineering;Bilingual corpora;Bilingual dictionary;Bilingual lexicon extractions;Comparable corpora;Gold standards;Real-world;Under-resourced languages;Artificial intelligence
期刊:
NATURAL LANGUAGE ENGINEERING
ISSN:
1351-3249
年:
2018
卷:
24
期:
4
页码:
523-549
基金类别:
LI BO 1 GAUSSIER ERIC 2 YANG DAN 3 1 Department of Computer Science , Central China Normal University , Wuhan , China e-mail: libo@mail.ccnu.edu.cn 2 CNRS-LIG/AMA , Université Grenoble Alpes , Grenoble , France e-mail: eric.gaussier@imag.fr 3 China Electric Power Research Institute , Wuhan , China e-mail: yangdan3@epri.sgcc.com.cn † This work was co-supported by Natural Science Foundation of China (Nos. 61300144 and 61572223), State Language Commission of China (No. YB125-132), Humanity and Social Science Foundation of Ministry of Education of China (No. 15YJC870029) and the Fundamental Research Funds for Central Universities (No. CCNU16A06015, CCNU15A05062, CCNU17GF0005, CCNUSZ2017024). 07 2018 15 01 2018 24 4 523 549 02 03 2017 14 12 2017 15 12 2017 Copyright © Cambridge University Press 2018 2018 Cambridge University Press
机构署名:
本校为第一且通讯机构
院系归属:
计算机学院
摘要:
Comparable corpora serve as an important substitute for parallel resources in cases of under-resourced language pairs. Previous work mostly aims to find a better strategy to exploit existing comparable corpora, while ignoring the variety in corpus quality. The quality of comparable corpora affects a lot its usability in practice, a fact that has been justified by several studies. However, researchers have not been able to establish a widely accepted and fully validated framework to measure corpus quality. We will thus investigate in this paper a comprehensive methodology to deal with the quali...

反馈

验证码:
看不清楚,换一个
确定
取消

成果认领

标题:
用户 作者 通讯作者
请选择
请选择
确定
取消

提示

该栏目需要登录且有访问权限才可以访问

如果您有访问权限,请直接 登录访问

如果您没有访问权限,请联系管理员申请开通

管理员联系邮箱:yun@hnwdkj.com