一种基于语义匹配的Web信息提取方法研究

首页 > 成果 > 详情

认领

导出

Link by 中国知网学术期刊 Link by 万方学术期刊

反馈

作者信息关键词期刊信息基础信息归属信息摘要

成果类型：

期刊论文

论文标题(英文)：

An Information Extraction Based on Semantic Matching for Web Pages

作者：

张茂元;邹春燕;卢正鼎

作者机构：

[张茂元; 卢正鼎] 华中科技大学计算机科学与技术学院

华中科技大学管理学院

[邹春燕] 华中师范大学外国语学院

语种：

中文

关键词：

信息提取;语义;匹配

期刊：

计算机工程与应用

ISSN：

1002-8331

年：

2006

卷：

期：

页码：

141-143

DOI：

10.3321/j.issn:1002-8331.2006.23.043

基金类别：

国家自然科学基金

机构署名：

本校为其他机构

院系归属：

外国语学院

摘要：

为了较好地解决信息过量难以消化、汉语词的歧义划分、Web信息形式不一致并且难以辨识的问题,文章提出了一种基于语义匹配的Web信息提取方法.该方法融合了网页分类、汉语分词、语义信息匹配方法,并给出了一种义素相似度,进而提出了一种基于语义的信息匹配方法来识别和提取网页信息项.基于这种Web信息提取方法的网上药品信息监管系统Web-MIND能够提取出网上药品广告的信息项,并具有较高的准确率.

摘要(英文)：

Some problems exist in all these Web information,for example：difficulty in processing excessive information,Chinese word segmentation for the ambiguous words,the information of variable formats,and the recognition of information.In order to solve those problems,an information extraction of web pages based on semantic matching is proposed in this paper.The extraction method integrates the classification method of Web pages,segmentation method of Chinese words and semantic-matching method of information.Moreover,the extraction method proposes a ...

反馈

产权有误：本人成果被他人认领

数据有误：数据基本信息有误

归属有误：成果的院系归属、机构署名归属有误

其他原因：

验证码：

看不清楚，换一个

确定

取消

成果认领

标题：

用户	作者	通讯作者	--
	请选择	请选择	--

确定

取消

一种基于语义匹配的Web信息提取方法研究

反馈

成果认领

提示

该栏目需要登录且有访问权限才可以访问