何婷婷 - 计算机学院 - 学者 - 华中师范大学智汇云

首页 > 学者 > 学者详情

何婷婷

博士

计算机学院

研究方向：从事网络媒体监测；自然语言处理；信息检索；数据库应用

个人成果详细资料

QQ 微信微博

认领成果

疑似成果

成果类型

请选择成果类型

全部

期刊论文

会议论文

专利

项目

筛选

开始检索

已无可筛选条件

成果类型

期刊论文

191

会议论文

项目

专利

年份 (1996~2021)

年

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

语种

英文

168

中文

期刊

Lecture Notes in Computer Science

计算机科学

2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM)

中文信息学报

BMC Bioinformatics

IEEE Transactions on NanoBioscience

INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS

Methods

PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE

PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM)

2014 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM)

计算机工程

2008 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2

2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM)

2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM)

Frontiers in Genetics

IEEE NLP-KE 2008: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING

Information-An International Interdisciplinary Journal

International Conference on Information and Knowledge Management, Proceedings

Proceedings of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE'05)

作者

何婷婷

Tingting He

He, Tingting

He, T.

He, Tingting

Hu, Xiaohua

Jiang, Xingpeng

Xiaohua Hu

Hu, X.

Xingpeng Jiang

He, Tingting

Shen, X.

Yang, Jincai

He Tingting

Jiang, X.

Jiang, Xingpeng

Shen, Xianjun

Hu, Xiaohua

Xianjun Shen

Zhou, Guangyou

关键词

protein complexes

自然语言处理

词义消歧

Microbiome

中文信息处理

Information retrieval

Microbial interactions

Text mining

information retrieval

microbiome

自动文摘

计算机应用

Feature Selection

biomarkers

deep learning

microbiome

Algorithm CACE

Algorithms

Bioinformatics databases

Biological network

机构署名

本校为第一机构

193

本校为通讯机构

138

本校为第一且通讯机构

136

本校为其他机构

院系归属

计算机学院

193

国家数字化学习工程技术研究中心

信息管理学院

教育信息技术学院

生命科学学院

职业与继续教育学院

数学与统计学学院

教育学院

排序：

时间

7/12

每页显示条

请选择

共223条记录，

Manifold learning reveals nonlinear structure in metagenomic profiles

作者： Jiang, Xingpeng（蒋兴鹏）;Hu, Xiaohua;Shen, Huiyu;He, Tingting（何婷婷）

期刊： ,2012年:5-10

通讯作者： Jiang, X.

作者机构： College of Information Science and Technology, Drexel University, Philadelphia, PA, United States;Central China Normal University, Wuhan, Hubei, China

通讯机构： College of Information Science and Technology, Drexel University, United States

会议名称： IEEE International Conference on Bioinformatics and Biomedicine

会议时间： 2012-10-04

会议地点： Philadelphia, PA(US)

会议论文集名称： 2012 IEEE international conference on bioinformatics and biomedicine

关键词： Isomap;Nonlinear dimension reduction;metagenomic profile;non-negative matrix factorization;principle component analysis

摘要： Using metagenomics to detect the global structure of microbial community remains a significant challenge. The structure of a microbial community and its functions are complicated not only because of the complex interactions among microbes but also their complicate interacting with confounding environmental factors. Recently dimension reduction methods such as Principle component analysis, Non-negative matrix factorization and Canonical correlation analysis have been employed extensively to investigate the complex structure embedded in metagenomic profiles which summarize the abundance of functional or taxonomic categorizations in metagenomic studies. However, metagenomic profiles are not necessary to meet the &#x0022;Assumption of Linearity&#x0022; behind these methods. Therefore it is worth to investigate how nonlinear methods can be utilized in metagenomic studies. In this paper, a nonlinear manifold learning method- Isomap is used to visualize and analyze large-scale metagenomic profiles. Isomap was applied on a large-scale Pfam profile which are derived from 45 metagenomes in Global Ocean Sampling expedition. In our result, a novel nonlinear structure of protein families is identified and the relationships among the identified nonlinear components and environmental factors of global ocean are explored. The results indicate the strength of nonlinear methods in learning the complex microbial structure. With the coming of the huge number of new sequenced metagenomes, nonlinear methods like Isomap could be necessary complementary tools to current widely used methods.

语种：英文

展开

导出

原文链接

认领

Incorporating word correlation into tag-topic model for semantic knowledge acquisition

作者： Fang Li;Tingting He（何婷婷）;Xinhui Tu;Xiaohua Hu

期刊： ACM International Conference Proceeding Series,2012年:1622-1626

通讯作者： Li, F.

作者机构： Department of Computer Science, Central China Normal University, Wuhan, China;National Engineering Research Center for E-Learning, Central China Normal University, Wuhan, China;College of Information Science and Technology, Drexel University, Philadelphia, PA, United States

通讯机构： National Engineering Research Center for E-Learning, Central China Normal University, China

会议名称： Proceedings of the 21st ACM international conference on Information and knowledge management

关键词： blog;dirichlet forest prior;tag;topic model

摘要： This paper presents a tag-topic model with Dirichlet Forest prior (TTM-DF) for semantic knowledge acquisition from blog. The TTM-DF model extends the tag-topic model (TTM) by replacing the Dirichlet prior with the Dirichlet Forest prior over the topic-word multinomial. The correlation between words are calculated to generate a set of Must-Links and Cannot-Links, then the structures of Dirichlet trees are obtained though encoding the constraints of Must-Links and Cannot-Links. Words under the same subtrees are expected to be more correlated than words under different subtrees. We conduct experiments on a synthetic and a blog dataset. Both of the experimental results show that the TTM-DF model performs much better than the TTM model. It can improve the coherence of the underlying topics and the tag-topic distributions, and capture semantic knowledge effectively. © 2012 ACM.

语种：英文

展开

导出

原文链接

认领

Estimating functional groups in human gut microbiome with probabilistic topic models

作者： Chen, Xin;He, TingTing*（何婷婷）;Hu, Xiaohua;Zhou, Yanhong;An, Yuan;...

期刊： IEEE Transactions on NanoBioscience,2012年11(3):203-215 ISSN：1536-1241

通讯作者： He, TingTing（何婷婷）

作者机构： [Chen, Xin; Hu, Xiaohua; An, Yuan] Drexel Univ, Coll Informat Sci & Technol, Philadelphia, PA 19104 USA.;[He, TingTing] Cent China Normal Univ, Dept Comp Sci, Wuhan, Peoples R China.;[Zhou, Yanhong] Huazhong Univ Sci & Technol, Sch Life Sci & Technol, Wuhan, Peoples R China.;[Wu, Xindong] Univ Vermont, Dept Comp Sci, Burlington, VT USA.

通讯机构： [He, TingTing] C;Cent China Normal Univ, Dept Comp Sci, Wuhan, Peoples R China.

关键词： Bioinformatics databases;biological data mining;metagenomics;probabilistic topic model

摘要： In this paper, based on the functional elements derived from non-redundant CDs catalogue, we show that the configuration of functional groups in meta-genome samples can be inferred by probabilistic topic modeling. The probabilistic topic modeling is a Bayesian method that is able to extract useful topical information from unlabeled data. When used to study microbial samples (assuming that relative abundance of functional elements is already obtained by a homology-based approach), each sample can be considered as a document, which has a mixture of functional groups, while each functional group (also known as a latent topic) is a weight mixture of functional elements (including taxonomic levels, and indicators of gene orthologous groups and KEGG pathway mappings). The functional elements bear an analogy with words. Estimating the probabilistic topic model can uncover the configuration of functional groups (the latent topic) in each sample. The experimental results demonstrate the effectiveness of our proposed method. © 2002-2011 IEEE.

语种：英文

展开

导出

原文链接

认领

Tag-topic model for semantic knowledge acquisition from blogs

作者： Li, Fang;Shen, Huiyu;He, Tingting（何婷婷）

期刊： NLP-KE 2011 - Proceedings of the 7th International Conference on Natural Language Processing and Knowledge Engineering,2011年:221-226

通讯作者： Shen, H.

作者机构： [Li, Fang] Engineering and Research Center for Information Technology on Education, Huazhong Normal University, Wuhan, China;[He, Tingting] Department of Computer Science, Huazhong Normal University, Wuhan, China;[Shen, Huiyu] Huazhong Normal University Press, Huazhong Normal University, Wuhan, China

通讯机构： Huazhong Normal University Press, Huazhong Normal University, China

关键词： Perplexity;Semantic Knowledge Acquisition;Tag;Topic Model

摘要： This paper proposed a tag-topic model for semantic knowledge acquisition from blogs. The model extends the Latent Dirichlet Allocation by adding a tag layer between the document and topic layer, it represents each document with a mixture of tags, each tag is associated with a multinomial distribution over topics and each topic is associated with a multinomial distribution over words. After parameters estimating, the tags are regarded as concepts, the top words arranged to the top topics are selected as related words of the concepts, and PMI-IR is utilized for filtering out noisy words to improve the quality of the semantic knowledge. Experimental results show that the tag-topic model can effectively capture semantic knowledge. ©2011 IEEE.

语种：英文

展开

导出

原文链接

认领

基于LDA模型的文本聚类研究

作者：董婧灵;李芳;何婷婷（何婷婷）;涂新辉;万剑

作者机构： [何婷婷; 董婧灵; 李芳; 涂新辉; 万剑] 华中师范大学计算机科学与技术系;[何婷婷; 董婧灵; 李芳; 涂新辉; 万剑] 国家语言资源监测与研究中心网络媒体语言分中心

会议名称：第十一届全国计算语言学学术会议

会议时间： 2011-08-20

会议地点：洛阳

会议论文集名称：第十一届全国计算语言学学术会议论文集

关键词：主题模型;文本聚类

摘要： LDA(Latent Dirichlet Allocation)是近年来提出的一种具有文本主题表示能力的非监督学习模型。本文提出了一种基于LDA主题模型的文本聚类和聚簇描述方法。利用LDA模型挖掘隐藏在文本内的不同主题与词之间的关系,得到文本的主题分布;并将此分布作为特征融入到传统的向量空间模型来计算相似度进而对文本进行聚类;再利用主题信息对聚类结果进行聚簇描述。实验结果表明本文的方法能够明显地提

语种：中文

展开

导出

原文链接

认领

Perspective hierarchical dirichlet process for user-tagged image modeling

作者： Chen, Xin;Hu, Xiaohua;An, Yuan;Xiong, Zunyan;He, Tingting（何婷婷）;...

期刊： International Conference on Information and Knowledge Management, Proceedings,2011年:1341-1346

通讯作者： Chen, X.(bruce.chen@drexel.edu)

作者机构： [Xiong, Zunyan; Chen, Xin; Hu, Xiaohua; An, Yuan] College of Information Science and Technology, Drexel University, Philadelphia, PA 19104, United States;[Park, E.K.] California State University - Chico, Chico, CA 95929, United States;[He, Tingting] Dept. of Computer Science, Central China Normal University, Wuhan, China

摘要： In this paper, we proposed a perspective Hierarchical Dirichlet Process (pHDP) model to deal with user-tagged image modeling. The contribution is two-fold. Firstly, we associate image features with image tags. Secondly, we incorporate the user's perspectives into the image tag generation process and introduce new latent variables to determine if an image tag is generated from user's perspectives or from the image content. Therefore, the model is able to extract both embedded semantic components and user's perspectives from user-tagged images. Based on the proposed pHDP model, we achieve automatic image tagging with users' perspective. Experimental results show that the pHDP model achieves better image tagging performance compared to state-of-the-art topic models. ©2011 ACM.

语种：英文

展开

导出

原文链接

认领

基于树型正交前向选择方法的可调核函数模型

作者：张猛;付丽华;何婷婷（何婷婷）;魏志成

期刊： 信号处理,2011年27(10):1576-1580 ISSN：1003-0530

作者机构： [何婷婷; 张猛] 华中师范大学计算机科学系;[付丽华] （武汉）中国地质大学数学与物理学院;[魏志成] 河北师范大学物理科学与信息工程学院

会议名称：第十五届全国信号处理学术年会

会议时间： 2011-11-17

会议地点：北京

会议论文集名称：第十五届全国信号处理学术年会论文集

关键词：正交前向选择;核函数模型;树型搜索

摘要：基于留一准则的正交前向选择算法(Orthogonal Forward Selection based on Leave-One-Out Criteria,OFS-LOO)是最近提出的一种数据建模方法,它能够产生鲁棒性好的参数可调的核函数回归模型。OFS-LOO采用贪婪算法策略,利用全局优化算法逐项调节每个回归项的参数,逐步地增加模型的项数,减少留一准则函数值。但是OFS-LOO仅保留当前最优解作为新回归项的参数,而忽略当前的选择对以后步骤的影响,破坏了模型的稀疏性。本文在OFS-LOO的框架下提出了一种新颖的树型算法。在选择核函数模型的每一项时,采用重复加权增进搜索(Repeated Weighted Boosting Search,RWBS)算法,同时保留RWBS得到的多个局部极值作为核函数参数的候选项。新方法试图找到传统OFS-LOO和全局最优解之间的折衷。实验表明,与传统方法相比,新方法得到的核函数模型稀疏性更好,泛化能力更强。

语种：中文

展开

导出

原文链接

认领

Boosting Naive Bayes Text Categorization by Using Cloud Model

作者： Wan, Jian*;He, Tingting（何婷婷）;Chen, Jinguang;Dong, Jinling

作者机构： [He, Tingting; Dong, Jinling; Wan, Jian] Huazhong Normal Univ, Dept Comp Sci & Technol, Wuhan 430079, Peoples R China.;[Chen, Jinguang] Huazhong Normal Univ, Engn & Res Ctr Informat Technol Educ, Wuhan 430079, Peoples R China.;[Chen, Jinguang] Huzhou Teachers Coll, Sch Teacher Educ, Wuhan 313000, Peoples R China.

会议名称： International Conference on Computer, Electrical, and Systems Sciences, and Engineering

会议时间： APR 10-11, 2011

会议地点： Wuhan, PEOPLES R CHINA

会议主办单位： [Wan, Jian;He, Tingting;Dong, Jinling] Huazhong Normal Univ, Dept Comp Sci & Technol, Wuhan 430079, Peoples R China.^[Chen, Jinguang] Huazhong Normal Univ, Engn & Res Ctr Informat Technol Educ, Wuhan 430079, Peoples R China.^[Chen, Jinguang] Huzhou Teachers Coll, Sch Teacher Educ, Wuhan 313000, Peoples R China.

关键词： Naive Bayes;Cloud Model;Feature Selection;Text Categorization

摘要： This paper presents a method which improves effectiveness of Naive Bayes text categorization by using cloud model. The traditional Naive Bayes text categorization directly uses term frequency to describe the relationship between words and categories. In deed, there are many words with high frequency do not have a close relevance with the category. To solve this problem, we introduce cloud model theory into Naive Bayes text classification and build a new feature selection system. By using numerical characteristics of cloud, we obtain more representative features. Experimental results on 20 Newsgroups show that our method can improve accuracy of text categorization remarkably.

语种：英文

展开

导出

认领

Query-focused multi-document summarization using cloud model

作者： Chen, Jinguang;He, Tingting*（何婷婷）

期刊： Information-An International Interdisciplinary Journal,2011年14(3):951-956 ISSN：1343-4500

通讯作者： He, Tingting

作者机构： [He, Tingting; Chen, Jinguang] Huazhong Normal Univ, Engn & Res Ctr Informat Technol Educ, Wuhan 430079, Peoples R China.;[He, Tingting] Huazhong Normal Univ, Dept Comp Sci & Technol, Wuhan 430079, Peoples R China.;[Chen, Jinguang] Huzhou Teachers Coll, Sch Teacher Educ, Huzhou 313000, Peoples R China.

通讯机构： [He, Tingting] H;Huazhong Normal Univ, Engn & Res Ctr Informat Technol Educ, Wuhan 430079, Peoples R China.

会议名称： 2011 International Conference on Intelligent Computing and Information Science (ICICIS 2011)

会议时间： JAN 09-10, 2011

会议地点： Chonqqing, PEOPLES R CHINA

会议主办单位： [Chen, Jinguang;He, Tingting] Huazhong Normal Univ, Engn & Res Ctr Informat Technol Educ, Wuhan 430079, Peoples R China.^[He, Tingting] Huazhong Normal Univ, Dept Comp Sci & Technol, Wuhan 430079, Peoples R China.^[Chen, Jinguang] Huzhou Teachers Coll, Sch Teacher Educ, Huzhou 313000, Peoples R China.

关键词： Cloud model;Query-focused multi-document summarization;Text summarization;Uncertainty

摘要： This paper presents a method called CloudSum which improves effectiveness of query-focused multi-document summarization by handling uncertainties with cloud model. In CloudSum, three cloud models are defined corresponding to the key phases of summarization, contributions of words and sentences are obtained by taking into account both fuzziness and randomness of their basic distribution, instead of considering the later only as in statistical method. Experiments on the DUC2005, DUC2006, DUC2007, TAC2008, TAC2009 corpuses show that our method achieves results comparable to the best available systems. CloudSum also achieves good results in participating the just ended TAC 2010. © 2011 International Information Institute.

语种：英文

展开

导出

认领

User-oriented summary extraction for soccer video based on multimodal analysis

作者： Liu Huayong*;Jiang Shanshan;He Tingting（何婷婷）

期刊： Proceedings of SPIE - The International Society for Optical Engineering,2011年8004:1-8 ISSN：0277-786X

通讯作者： Liu Huayong

作者机构： [He Tingting; Liu Huayong; Jiang Shanshan] Cent China Normal Univ, Dept Comp Sci, Wuhan 430079, Peoples R China.

通讯机构： [Liu Huayong] C;Cent China Normal Univ, Dept Comp Sci, Wuhan 430079, Peoples R China.

会议名称：第七届多光谱图象处理与模式识别国际学术会议

会议时间： 2011-11-01

会议地点：桂林

会议主办单位： [Liu Huayong;Jiang Shanshan;He Tingting] Cent China Normal Univ, Dept Comp Sci, Wuhan 430079, Peoples R China.

会议论文集名称：第七届多光谱图象处理与模式识别国际学术会议论文集

关键词： video summary;soccer video;user-oriented model;highlight extraction;multimodal analysis

摘要： An advanced user-oriented summary extraction method for soccer video is proposed in this work. Firstly, an algorithm of user-oriented summary extraction for soccer video is introduced. A novel approach that integrates multimodal analysis, such as extraction and analysis of the stadium features, moving object features, audio features and text features is introduced. By these features the semantic of the soccer video and the highlight mode are obtained. Then we can find the highlight position and put them together by highlight degrees to obtain the video summary. The experimental results for sports video of world cup soccer games indicate that multimodal analysis is effective for soccer video browsing and retrieval.

语种：英文

展开

导出

原文链接

认领

Inferring Functional Groups from Microbial Gene Catalogue with Probabilistic Topic Models

作者： Chen, Xin*;He, TingTing（何婷婷）;Hu, Xiaohua;An, Yuan;Wu, Xindong

期刊： 2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM 2011),2011年:3-9 ISSN：2156-1125

通讯作者： Chen, Xin

作者机构： [Chen, Xin; Hu, Xiaohua; An, Yuan] Drexel Univ, Coll Informat Sci & Technol, Philadelphia, PA 19104 USA.;[He, TingTing] Cent China Normal Univ, Dept Comp Sci, Wuhan, Hubei, Peoples R China.;[Wu, Xindong] Univ Vermont, Dept Comp Sci, Burlington, VT USA.

通讯机构： [Chen, Xin] D;Drexel Univ, Coll Informat Sci & Technol, Philadelphia, PA 19104 USA.

会议名称： IEEE International Conference on Bioinformatics & Biomedicine (BIBM 2011)

会议时间： 2011-01-01

会议地点： Atlanta, Georgia, USA

会议主办单位： [Chen, Xin;Hu, Xiaohua;An, Yuan] Drexel Univ, Coll Informat Sci & Technol, Philadelphia, PA 19104 USA.^[He, TingTing] Cent China Normal Univ, Dept Comp Sci, Wuhan, Hubei, Peoples R China.^[Wu, Xindong] Univ Vermont, Dept Comp Sci, Burlington, VT USA.

会议论文集名称： 2011 IEEE International Conference on Bioinformatics and Biomedicine

关键词： Bioinformatics databases;Biological data mining;Metagenomics;Probabilistic topic model

摘要： In this paper, based on the functional elements derived from non-redundant CDs catalogue, we show that the configuration of functional groups in meta-genome samples can be inferred by probabilistic topic modeling. The probabilistic topic modeling is a Bayesian method that is able to extract useful topical information from unlabeled data. When used to study microbial samples (assuming that relative abundance of functional elements is already obtained by a homology-based approach), each sample can be considered as a 'document', which has a mixture of functional groups, while each functional group (also known as a 'latent topic') is a weight mixture of functional elements (including taxonomic levels, and indicators of gene orthologous groups and KEGG pathway mappings). The functional elements bear an analogy with 'words'. Estimating the probabilistic topic model can uncover the configuration of functional groups (the latent topic) in each sample. The experimental results demonstrate the effectiveness of our proposed method.

语种：英文

展开

导出

原文链接

认领

Improving effectiveness of naïe bayes text classification by introducing cloud model

作者： Chen, Jinguang;Wan, Jian;He, Tingting（何婷婷）

期刊： Journal of Computational Information Systems,2011年7(13):4963-4971 ISSN：1553-9105

通讯作者： He, T.(tthe@ccnu.edu.cn)

作者机构： [Chen, Jinguang] Engineering and Research Center for Information Technology on Education, Huazhong Normal University, Wuhan 430079, China;[Chen, Jinguang] School of Teacher Education, Huzhou Teachers College, Huzhou 313000, China;[He, Tingting; Wan, Jian] Department of Computer Science and Technology, Huazhong Normal University, Wuhan 430079, China

通讯机构： Department of Computer Science and Technology, Huazhong Normal University, China

关键词： Cloud model;Naïe bayes;Text classification

摘要： By introducing cloud model, this paper presents a method which improves effectiveness of Nai&die;e Bayes text classification. The traditional Nai&die;e Bayes text categorization directly uses term frequency to describe the relationship between words and categories. In deed, there are many words with high frequency do not have a close relevance with the category. To solve this problem, we introduce cloud model theory into Nai&die;e Bayes text classification and build a new feature selection system. By using numerical characteristics of cloud, we obtain more representative features. Experimental results on 20 Newsgroups show that our method can improve accuracy of text categorization remarkably. ©2011 Binary Information Press December, 2011.

语种：英文

展开

导出

认领

Research on sentiment classification of blog based on PMI-IR

作者： Xiuting DUAN;Tingting HE（何婷婷）;Le SONG

期刊： Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering, NLP-KE 2010,2010年:1-6

通讯作者： Duan, X.(abc381858424@163.com)

作者机构： [Tingting HE; Le SONG; Xiuting DUAN] Department of Computer Science, Huazhong Normal University, Wuhan, Hubei, China

通讯机构： Department of Computer Science, Huazhong Normal University, China

会议名称： The 6th International Conference on Natural Language Processing and Knowledge Engineering(第六届IEEE自然语言处理与知识工程国际会议 NLP-KE 2010)

会议时间： 2010-08-21

会议地点：北京

会议论文集名称： The 6th International Conference on Natural Language Processing and Knowledge Engineering(第六届IEEE自然语言处理与知识工程国际会议 NLP-KE 2010)论文集

关键词： Mutual information;PMI-IR algorithm;Semantic classification

摘要： Development of Blog texts information on the internet has brought new challenge to Chinese text classification. Aim to solving the semantics deficiency problem in traditional methods for Chinese text classification, this paper implements a text classification method on classifying a blog as joy, angry, sad or fear using a simple unsupervised learning algorithm. The classification of a blog text is predicted by the max semantic orientation (SO) of the phrases in the blog text that contains adjectives or adverbs. In this paper, the SO of a phrase is calculated as the mutual information between the given phrase and the polar words. Then the SO of the given blog text is determined by the max mutual information value. A blog text is classified as joy if the SO of its phrases is joy. Two different corpora are adopted to test our method, one is the Blog corpus collected by Monitor and Research Center for National Language Resource Network Multimedia Sub-branch Center, and the other is Chinese dataset provided by COAE2008 task. Based on the two datasets, the method respectively achieves a high improvement compared to the traditional methods. ©2010 IEEE.

语种：英文

展开

导出

原文链接

认领

Answer diversification for complex question answering on the web

作者： Achananuparp, Palakorn*;Hu, Xiaohua;He, Tingting（何婷婷）;Yang, Christopher C.;An, Yuan;...

期刊： Lecture Notes in Computer Science,2010年6118(PART 1):375-382 ISSN：0302-9743

通讯作者： Achananuparp, Palakorn

作者机构： [Achananuparp, Palakorn; Hu, Xiaohua; Guo, Lifan; An, Yuan; Yang, Christopher C.] Drexel Univ, Coll Informat Sci & Technol, Philadelphia, PA 19104 USA.;[He, Tingting] Cent China Normal Univ, Dept Comp Sci, Wuhan, Peoples R China.

通讯机构： [Achananuparp, Palakorn] D;Drexel Univ, Coll Informat Sci & Technol, Philadelphia, PA 19104 USA.

会议名称： 14th Pacific-Asia Conference on Knowledge Discovery and Data Mining

会议时间： JUN 21-24, 2010

会议地点： Hyderabad, INDIA

会议主办单位： [Achananuparp, Palakorn;Hu, Xiaohua;Yang, Christopher C.;An, Yuan;Guo, Lifan] Drexel Univ, Coll Informat Sci & Technol, Philadelphia, PA 19104 USA.^[He, Tingting] Cent China Normal Univ, Dept Comp Sci, Wuhan, Peoples R China.

会议论文集名称： Lecture Notes in Artificial Intelligence

关键词： Answer diversification;answer reranking;random walk;negative-edge graph;complex question answering

摘要： We present a novel graph ranking model to extract a diverse set of answers for complex questions via random walks over a negative-edge graph. We assign a negative sign to edge weights in an answer graph to model the redundancy relation among the answer nodes. Negative edges can be thought of as the propagation of negative endorsements or disapprovals which is used to penalize factual redundancy. As the ranking proceeds, the initial score of the answer node, given by its relevancy to the specific question, will be adjusted according to a long-term negative endorsement from other answer nodes. We empirically evaluate the effectiveness of our method by conducting a comprehensive experiment on two distinct complex question answering data sets.

语种：英文

展开

导出

原文链接

认领

Obtaining Chinese semantic knowledge from online encyclopedia

作者： Yang, Liu;He, Tingting（何婷婷）;Tu, Xinhui;Chen, Jinguang

期刊： Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering, NLP-KE 2010,2010年

通讯作者： Yang, L.(yangliu721@gmail.com)

作者机构： [He, Tingting; Yang, Liu] Department of Computer Science and Technology, Huazhong Normal University, Wuhan, Hubei, China;[Chen, Jinguang; Tu, Xinhui] Engineering and Research Center for Information Technology on Education, Huazhong Normal University, Wuhan, Hubei, China

通讯机构： Department of Computer Science and Technology, Huazhong Normal University, China

关键词： Concept;Encyclopedia;Semantic knowledge;Semantic relatedness

摘要： This paper proposes a method to obtain the semantic knowledge from an online encyclopedia called Hudong encyclopedia <sup>2</sup> (hudong baike). We obtain concepts and then their semantic related concepts and compute the semantic relatedness by utilizing inner hyperlinks and the open category information in Hudong encyclopedia. By comparing our results with human judgments, we show that our relatedness com puting method is quite effective. ©2010 IEEE.

语种：英文

展开

导出

原文链接

认领

汉语语义知识获取与语义计算模型研究

项目作者：何婷婷（何婷婷）

项目作者单位：华中师范大学

项目批准号： 90920005

资助经费： 50万

立项时间： 2010-01-01到2012-12-31

项目类别：重大研究计划

项目来源：国家自科基金项目

项目关键词：语义计算;知识获取;云模型;自然语言理解

项目摘要：研究适于句子与篇章的汉语语义计算模型，包括语义知识形式化表示方法、文本的语义表征模型、语义计算方法。研究内容和特色包括：①提出了一种显式表示概念的语义知识的形式化方法，把传统的基于语义知识库的方法与基于文档集统计分析的方法有机结合，取长补短，充分发挥各自的优势，又在一定程度上弥补各自的不足；②提出了用云模型来表征与计算不确定性概念的语义的方法，使得计算机能够在一定程度上理解不确定性概念的模糊性、随机性及二者间的关联性；③将这种语义知识平滑融入到现有的文本计算模型中建立文本的语义表征与计算模型，可显著提高计算机的语义理解能力；④提出与研究基于真实语言生活材料感知语义知识的策略，便于知识的动态更新、情境感知；⑤利用这种语义计算模型，研究影响自然语言处理的若干难点问题的解决方案，并通过网络文本信息检索的应用，验证研究成果的有效性。本研究对实现重大研究计划的总体目标有重要意义。

展开

导出

认领

A Probabilistic Topic-Connection model for automatic image annotation

作者： Chen, Xin;Hu, Xiaohua;Zhou, Zhongna;Lu, Caimei;Rosen, Gail;...

期刊： International Conference on Information and Knowledge Management, Proceedings,2010年:899-908

通讯作者： Chen, X.(bruce.chen@drexel.edu)

作者机构： [Lu, Caimei; Chen, Xin; Hu, Xiaohua] College of Information Science and Technology, Drexel University, Philadelphia, PA, United States;[Park, E.K.] CSI-CUNY, Staten Island, NY, United States;[He, Tingting] Dept. of Computer Science, Central China Normal University, Wuhan, China;[Rosen, Gail] Dept. of ECE, Drexel University, Philadelphia, PA, United States;[Zhou, Zhongna] Dept. of ECE, University of Missouri, Columbia, MO, United States

摘要： The explosive increase of image data on Internet has made it an important, yet very challenging task to index and automatically annotate image data. To achieve that end, sophisticated algorithms and models have been proposed to study the correlation between image content and corresponding text description. Despite the success of previous works, however, researchers are still facing two major difficulties that may undermine their effort of providing reliable and accurate annotations for images. The first difficulty is lacking of comprehensive benchmark image dataset with high quality text descriptions. The second difficulty is lacking of effective way to represent the image content and make it associate with the text descriptions. In our paper, we aim to deal with both problems. To deal with the first problem, we utilize Wikipedia as external knowledge source and enrich the ontology structure of ImageNet database with comprehensive and highly-reliable text descriptions from Wikipedia articles. To address the second problem, we develop a Probabilistic Topic-Connection (PTC) model to represent the connection between latent semantic topic in text description and latent patterns from image feature space. We compare the performance of our model with the currently popular Correspondence LDA (Corr-LDA) model under the same automatic image annotation scenario using cross-validation. Experimental results demonstrate that our model is able to well represent the connection between latent semantic topics and latent patterns in image feature space, thus facilitates knowledge organization and understanding of both image and text descriptions. ©2010 ACM.

语种：英文

展开

导出

原文链接

认领

Wikipedia-based semantic smoothing for the language modeling approach to information retrieval

作者： Tu, Xinhui*;He, Tingting（何婷婷）;Chen, Long;Luo, Jing;Zhang, Maoyuan

期刊： Lecture Notes in Computer Science,2010年5993:370-381 ISSN：0302-9743

通讯作者： Tu, Xinhui

作者机构： [He, Tingting; Tu, Xinhui; Zhang, Maoyuan] Huazhong Normal Univ, Engn & Res Ctr Informat Technol Educ, Wuhan, Peoples R China.;[Chen, Long] Univ London Birkbeck Coll, London WC1E 7HU, England.;[Tu, Xinhui; Luo, Jing] Wuhan Univ Sci & Technol, Dept Comp Sci & Technol, Wuhan, Peoples R China.

通讯机构： [Tu, Xinhui] H;Huazhong Normal Univ, Engn & Res Ctr Informat Technol Educ, Wuhan, Peoples R China.

会议名称： 32nd European Conference on Information Retrieval Research

会议时间： MAR 28-31, 2010

会议地点： Milton Keynes, ENGLAND

会议主办单位： [Tu, Xinhui;He, Tingting;Zhang, Maoyuan] Huazhong Normal Univ, Engn & Res Ctr Informat Technol Educ, Wuhan, Peoples R China.^[Chen, Long] Univ London Birkbeck Coll, London WC1E 7HU, England.^[Tu, Xinhui;Luo, Jing] Wuhan Univ Sci & Technol, Dept Comp Sci & Technol, Wuhan, Peoples R China.

会议论文集名称： Lecture Notes in Computer Science

关键词： Information Retrieval;Language Model;Wikipedia

摘要： Semantic smoothing for the language modeling approach to information retrieval is significant and effective to improve retrieval performance. In previous methods such as the translation model, individual terms or phrases are used to do semantic mapping. These models are not very efficient when faced with ambiguous words and phrases because they are unable to incorporate contextual information. To overcome this limitation, we propose a novel Wikipedia-based semantic smoothing method that decomposes a document into a set of weighted Wikipedia concepts and then maps those unambiguous Wikipedia concepts into query terms. The mapping probabilities from each Wikipedia concept to individual terms are estimated through the EM algorithm. Document models based on Wikipedia concept mapping are then derived. The new smoothing method is evaluated on the TREC Ad Hoc Track (Disks 1, 2, and 3) collections. Experiments show significant improvements over the two-stage language model, as well as the language model with translation-based semantic smoothing.

语种：英文

展开

导出

原文链接

认领

The Topic-Perspective Model for social tagging systems

作者： Lu, Caimei;Hu, Xiaohua;Chen, Xin;Park, Jung-Ran;He, Ting Ting（何婷婷）;...

期刊： Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining,2010年:683-691

通讯作者： Lu, C.(caimei.lu@drexel.edu)

作者机构： [Park, Jung-Ran; Chen, Xin; Lu, Caimei; Hu, Xiaohua] College of Information Science and Technology, Drexel University, Philadelphia, PA, United States;[Li, Zhoujun] School of Computer Science and Engineering, Beihang University, Beijing, China;[He, Ting Ting] Department of Computer Science, Central China Normal University, Wuhan, China

通讯机构： College of Information Science and Technology, Drexel University, United States

关键词： Perplexity;Social annotation;Social tagging;User modeling

摘要： In this paper, we propose a new probabilistic generative model, called Topic-Perspective Model, for simulating the generation process of social annotations. Different from other generative models, in our model, the tag generation process is separated from the content term generation process. While content terms are only generated from resource topics, social tags are generated by resource topics and user perspectives together. The proposed probabilistic model can produce more useful information than any other models proposed before. The parameters learned from this model include: (1) the topical distribution of each document, (2) the perspective distribution of each user, (3) the word distribution of each topic, (4) the tag distribution of each topic, (5) the tag distribution of each user perspective, (6) and the probabilistic of each tag being generated from resource topics or user perspectives. Experimental results show that the proposed model has better generalization performance or tag prediction ability than other two models proposed in previous research. ©2010 ACM.

语种：英文

展开

导出

原文链接

认领

极性相似度计算在词汇倾向性识别中的应用

作者：宋乐;何婷婷（何婷婷）;王倩;闻彬

期刊： 中文信息学报,2010年24(4):63-67 ISSN：1003-0077

作者机构： [何婷婷; 宋乐; 闻彬; 王倩] 华中师范大学,计算机科学与技术系,湖北,武汉,430079;[何婷婷; 宋乐; 闻彬; 王倩] 国家语言资源监测与研究中心,网络媒体分中心,湖北,武汉,430079

关键词：计算机应用;中文信息处理;极性义原;极性相似度;极性值

摘要：该文提出了一种新的基于HowNet相似度计算的词汇倾向性识别方法。该方法首先利用HowNet中的"良"、"莠"极性义原进行一种新的相似度——极性相似度的计算,再计算出词汇的极性值,进而识别出词汇的极性倾向。大量实验证明了该方法能够有效地区分词汇的极性,并且在第一届中文倾向性分析评测(COAE2008)比赛中取得了很好的效果。

语种：中文

展开

导出

原文链接

认领

1...5 678 9... 12 共 12 页

成果认领

标题：

用户	作者	通讯作者	--
	请选择	请选择	--

确定

取消

成果认领

提示

该栏目需要登录且有访问权限才可以访问