何婷婷 - 计算机学院 - 学者 - 华中师范大学智汇云

首页 > 学者 > 学者详情

何婷婷

博士

计算机学院

研究方向：从事网络媒体监测；自然语言处理；信息检索；数据库应用

个人成果详细资料

QQ 微信微博

认领成果

疑似成果

成果类型

请选择成果类型

全部

期刊论文

会议论文

专利

项目

筛选

开始检索

已无可筛选条件

成果类型

期刊论文

191

会议论文

项目

专利

年份 (1996~2021)

年

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

语种

英文

168

中文

期刊

Lecture Notes in Computer Science

计算机科学

2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM)

中文信息学报

BMC Bioinformatics

IEEE Transactions on NanoBioscience

INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS

Methods

PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE

PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM)

2014 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM)

计算机工程

2008 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2

2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM)

2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM)

Frontiers in Genetics

IEEE NLP-KE 2008: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING

Information-An International Interdisciplinary Journal

International Conference on Information and Knowledge Management, Proceedings

Proceedings of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE'05)

作者

何婷婷

Tingting He

He, Tingting

He, T.

He, Tingting

Hu, Xiaohua

Jiang, Xingpeng

Xiaohua Hu

Hu, X.

Xingpeng Jiang

He, Tingting

Shen, X.

Yang, Jincai

He Tingting

Jiang, X.

Jiang, Xingpeng

Shen, Xianjun

Hu, Xiaohua

Xianjun Shen

Zhou, Guangyou

关键词

protein complexes

自然语言处理

词义消歧

Microbiome

中文信息处理

Information retrieval

Microbial interactions

Text mining

information retrieval

microbiome

自动文摘

计算机应用

Feature Selection

biomarkers

deep learning

microbiome

Algorithm CACE

Algorithms

Bioinformatics databases

Biological network

机构署名

本校为第一机构

193

本校为通讯机构

138

本校为第一且通讯机构

136

本校为其他机构

院系归属

计算机学院

193

国家数字化学习工程技术研究中心

信息管理学院

教育信息技术学院

生命科学学院

职业与继续教育学院

数学与统计学学院

教育学院

排序：

时间

3/12

每页显示条

请选择

共223条记录，

Prioritizing disease-causing microbes based on random walking on the heterogeneous network

作者： Shen, Xianjun*;Chen, Yao;Jiang, Xingpeng（蒋兴鹏）;Hu, Xiaohua;He, Tingting（何婷婷）;...

期刊： Methods,2017年124:120-125 ISSN：1046-2023

通讯作者： Shen, Xianjun

作者机构： [Jiang, Xingpeng; Yang, Jincai; He, Tingting; Shen, Xianjun; Hu, Xiaohua; Chen, Yao] Cent China Normal Univ, Sch Comp, Wuhan 430079, Hubei, Peoples R China.

通讯机构： [Shen, Xianjun] C;Cent China Normal Univ, Sch Comp, Wuhan 430079, Hubei, Peoples R China.

关键词： *Disease network;*Heterogeneous network;*Microbe network;*Random walk

摘要： As we all know, the microbiota show remarkable variability within individuals. At the same time, those microorganisms living in the human body play a very important role in our health and disease, so the identification of the relationships between microbes and diseases will contribute to better understanding of microbes interactions, mechanism of functions. However, the microbial data which are obtained through the related technical sequencing is too much, but the known associations between the diseases and microbes are very less. In bioinformatics, many researchers choose the network topology analysis to solve these problems. Inspired by this idea, we proposed a new method for prioritization of candidate microbes to predict potential disease-microbe association. First of all, we connected the disease network and microbe network based on the known disease-microbe relationships information to construct a heterogeneous network, then we extended the random walk to the heterogeneous network, and used leave-one-out cross-validation and ROC curve to evaluate the method. In conclusion, the algorithm could be effective to disclose some potential associations between diseases and microbes that cannot be found by microbe network or disease network only. Furthermore, we studied three representative diseases, Type 2 diabetes, Asthma and Psoriasis, and finally presented the potential microbes associated with these diseases by ranking candidate disease-causing microbes, respectively. We confirmed that the discovery of the new associations will be a good clinical solution for disease mechanism understanding, diagnosis and therapy.

语种：英文

展开

导出

原文链接

认领

Multi-View Clustering Microbiome Data by Joint Symmetric Nonnegative Matrix Factorization with Laplacian Regularization

作者： Ma, Yuanyuan;Hu, Xiaohua;He, Tingting（何婷婷）;Jiang, Xingpeng*（蒋兴鹏）

期刊： 2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM),2017年:625-630 ISSN：2156-1125

通讯作者： Jiang, Xingpeng（蒋兴鹏）

作者机构： [Ma, Yuanyuan] Cent China Normal Univ, Sch Informat Management, Wuhan, Peoples R China.;[Jiang, Xingpeng; He, Tingting; Hu, Xiaohua] Cent China Normal Univ, Sch Comp, Wuhan, Peoples R China.;[Ma, Yuanyuan] Anyang Normal Univ, Anyang, Peoples R China.

通讯机构： [Jiang, Xingpeng] C;Cent China Normal Univ, Sch Comp, Wuhan, Peoples R China.

会议名称： IEEE International Conference on Bioinformatics and Biomedicine (IEEE BIBM)

会议时间： DEC 15-18, 2016

会议地点： Shenzhen, PEOPLES R CHINA

会议主办单位： [Ma, Yuanyuan] Cent China Normal Univ, Sch Informat Management, Wuhan, Peoples R China.^[Hu, Xiaohua;He, Tingting;Jiang, Xingpeng] Cent China Normal Univ, Sch Comp, Wuhan, Peoples R China.^[Ma, Yuanyuan] Anyang Normal Univ, Anyang, Peoples R China.

会议论文集名称： IEEE International Conference on Bioinformatics and Biomedicine-BIBM

关键词： Human Microbiome;Laplacian Regularization;Multi-view Clustering;Symmetric Nonnegative Matrix Factorization

摘要： Many datasets existed in the real world are often comprised of different representations or views which provide complementary information to each other. For example, microbiome datasets can be represented by metabolic paths, taxonomic assignment or gene families. To integrate information from multiple views, data integration approaches such as methods based on nonnegative matrix factorization (NMF) have been developed to combine multi-view information simultaneously to obtain a comprehensive view which reveals the underlying data structure shared by multiple views. In this paper, we proposed a novel variant of symmetric nonnegative matrix factorization (SNMF), called Laplacian regularized joint symmetric nonnegative matrix factorization (LJ-SNMF) for clustering multi-view data. We conduct extensive experiments on several realistic datasets including Human Microbiome Project (HMP) data. The experimental results show that the proposed method outperforms other variants of NMF, which suggests the potential application of LJ-SNMF in clustering multi-view datasets. © 2016 IEEE.

语种：英文

展开

导出

原文链接

认领

Discovery of Online Learning Collaboration Group Based on Two-Stage Clustering Algorithm

作者： Luo Changri;Zhang Xinhua*;He Tingting*（何婷婷）;Huang Baohua;Wu, Shaojing;...

期刊： PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC),2017年:2286-2290

通讯作者： Zhang Xinhua;He Tingting

作者机构： [Luo Changri; Xie, Yaohui] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Sch Vocat & Continuing Educ, Wuhan, Hubei, Peoples R China.;[Zhang Xinhua] Wuhan Vocat Coll Software & Engn, Sch Comp Sci, Wuhan, Hubei, Peoples R China.;[He Tingting] Cent China Normal Univ, Acad Comp Sci, Wuhan, Hubei, Peoples R China.;[Huang Baohua] Cent China Normal Univ, Sch Vocat & Continuing Educ, Wuhan, Hubei, Peoples R China.;[Wu, Shaojing] Cent China Normal Univ, Sch Informat Management, Natl Engn Res Ctr E Learning, Wuhan, Hubei, Peoples R China.

通讯机构： [Zhang Xinhua] W;[He Tingting] C;Wuhan Vocat Coll Software & Engn, Sch Comp Sci, Wuhan, Hubei, Peoples R China.;Cent China Normal Univ, Acad Comp Sci, Wuhan, Hubei, Peoples R China.

会议名称： 3rd IEEE International Conference on Computer and Communications (ICCC)

会议时间： DEC 13-16, 2017

会议地点： Chengdu, PEOPLES R CHINA

会议主办单位： [Luo Changri;Xie, Yaohui] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Sch Vocat & Continuing Educ, Wuhan, Hubei, Peoples R China.^[Zhang Xinhua] Wuhan Vocat Coll Software & Engn, Sch Comp Sci, Wuhan, Hubei, Peoples R China.^[He Tingting] Cent China Normal Univ, Acad Comp Sci, Wuhan, Hubei, Peoples R China.^[Huang Baohua] Cent China Normal Univ, Sch Vocat & Continuing Educ, Wuhan, Hubei, Peoples R China.^[Wu, Shaojing] Cent China Normal Univ, Sch Informat Management, Natl Engn Res Ctr E Learning, Wuhan, Hubei, Peoples R China.

关键词： two-stage clustering;online learning;learning collaboration group

摘要： Many studies have confirmed that the role of online learning collaboration group is very important. For large-scale online learning, how to effectively find the learning collaboration group is a difficult problem. The online learning forum is the main place for learners to learn and communicate, so it is the main venue for learning collaboration groups to implement collaborative learning. In the implementation process of learning collaboration, there are two characteristics between learning team members. First, there is interaction between them. Second, the contents of their discussion have high relevance. In this paper, the study uses these two important characteristics and carries out two-stage clustering algorithm based on the interaction structure and interactive contents of learners to find the potential learning collaboration groups in the large-scale online learning forum. The experimental results show that the method proposed in this paper is effective. This has practical significance for large-scale online learning support services.

语种：英文

展开

导出

原文链接

认领

二值矩阵分解的认知建模方法研究

作者：张猛;付丽华;何婷婷（何婷婷）;杨青

期刊： 计算机科学,2017年44(10):265-268 ISSN：1002-137X

作者机构：华中师范大学计算机学院, 教育信息化协同创新中心, 武汉, 430079;中国地质大学(武汉)数学与物理学院, 武汉, 430074;华中师范大学计算机学院, 武汉, 430079;[何婷婷; 杨青] 华中师范大学计算机学院, 武汉, 430079;[张猛] 华中师范大学计算机学院, 教育信息化协同创新中心, 武汉, 430079

关键词：认知建模;二值矩阵分解;考题分类;学生成绩预测

摘要：根据考试反馈数据,提出新颖的逻辑斯提克二值矩阵分解方法,来预测未来的学生考试成绩并自动对考题进行模式分类,同时设计新的算法对建模中遇到的非凸优化问题进行求解。在模拟数据和真实的美国SAT考试数据上进行的实验发现,新方法不仅可以准确地预测学生的考试表现,而且能够将考题按照知识点进行自动模式分类。实验结果表明,新的方法相比经典方法在结果的可解释性和估计精度方面有明显的提升。

语种：中文

展开

导出

原文链接

认领

The Modularity of Microbial Interaction Network in Healthy Human Saliva: Stability and Specificity

作者： Liu, Dan;Jiang, Xingpeng（蒋兴鹏）;Zheng, Huiru (Jane);Xie, Bo;Wang, Haiying;...

期刊： 2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM),2017年2017-January:2048-2053 ISSN：2156-1125

通讯作者： Hu, Xiaohua

作者机构： [Jiang, Xingpeng; He, Tingting; Hu, Xiaohua; Liu, Dan] Cent China Normal Univ, Sch Comp, Wuhan, Hubei, Peoples R China.;[Zheng, Huiru (Jane); Wang, Haiying] Ulster Univ, Sch Comp & Math, Jordanstown, Antrim, North Ireland.;[Xie, Bo] Cent China Normal Univ, Sch Life Sci, Wuhan, Hubei, Peoples R China.

通讯机构： [Hu, Xiaohua] C;Cent China Normal Univ, Sch Comp, Wuhan, Hubei, Peoples R China.

会议名称： IEEE International Conference on Bioinformatics and Biomedicine (IEEE BIBM)

会议时间： NOV 13-16, 2017

会议地点： Kansas City, MI

会议主办单位： [Liu, Dan;Jiang, Xingpeng;He, Tingting;Hu, Xiaohua] Cent China Normal Univ, Sch Comp, Wuhan, Hubei, Peoples R China.^[Zheng, Huiru (Jane);Wang, Haiying] Ulster Univ, Sch Comp & Math, Jordanstown, Antrim, North Ireland.^[Xie, Bo] Cent China Normal Univ, Sch Life Sci, Wuhan, Hubei, Peoples R China.

会议论文集名称： IEEE International Conference on Bioinformatics and Biomedicine-BIBM

关键词： biological network;microbes interactions;microbiome;modularity;spectral clustering

摘要： The human oral cavity is an important habitat of microbes in the human body. It includes the colonization of various microorganisms such as bacteria, archaea, fungi, protozoa and viruses. Although oral diseases have been studied for decades, we have limited understanding of the boundaries of a healthy oral ecosystem and ecological shift toward dysbiosis. Here, we analyzed salivary microbiomes from 268 healthy adults after overnight fasting. The microbiome data set is firstly divided into five sample clusters based on the similarity pattern of microbial abundance. For each cluster, the correlation networks among salivary bacteria are constructed based on an ensemble of six correlations and two dissimilarity measure. The stability and specificity of modularity in the five microbial networks are investigated. The existences of conserved and changing modules were found across five microbial correlation networks. © 2017 IEEE.

语种：英文

展开

导出

原文链接

认领

Bacterial Named Entity Recognition based on Dictionary and Conditional Random Field

作者： Wang, Xiaoyan;Jiang, Xingpeng（蒋兴鹏）;Liu, Mengwen;He, Tingting（何婷婷）;Hu, Xiaohua*

期刊： 2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM),2017年2017-January:439-444 ISSN：2156-1125

通讯作者： Hu, Xiaohua

作者机构： [Jiang, Xingpeng; He, Tingting; Hu, Xiaohua; Wang, Xiaoyan] Cent China Normal Univ, Sch Comp, Wuhan 430079, Hubei, Peoples R China.;[Hu, Xiaohua; Liu, Mengwen] Drexel Univ, Coll Comp & Informat, Philadelphia, PA 19104 USA.

通讯机构： [Hu, Xiaohua] C;[Hu, Xiaohua] D;Cent China Normal Univ, Sch Comp, Wuhan 430079, Hubei, Peoples R China.;Drexel Univ, Coll Comp & Informat, Philadelphia, PA 19104 USA.

会议名称： IEEE International Conference on Bioinformatics and Biomedicine (IEEE BIBM)

会议时间： NOV 13-16, 2017

会议地点： Kansas City, MI

会议主办单位： [Wang, Xiaoyan;Jiang, Xingpeng;He, Tingting;Hu, Xiaohua] Cent China Normal Univ, Sch Comp, Wuhan 430079, Hubei, Peoples R China.^[Liu, Mengwen;Hu, Xiaohua] Drexel Univ, Coll Comp & Informat, Philadelphia, PA 19104 USA.

会议论文集名称： IEEE International Conference on Bioinformatics and Biomedicine-BIBM

关键词： conditional random field;microbial interaction;named entity recognition;text mining

摘要： There are intensive computational efforts to discover large-scale microbial interactions from metagenomic abundance data, however, it is often difficult to validate such inferred interactions without a manually curated dataset. There are also a number of small-scale microbial interactions reported in massive literature with experimental confidence. Text mining can be employed to extract such microbial interactions from biomedical literature which could be a significant complement to abundance-based method. The key tasks of text mining include named entity recognition and relation extraction. Named entity recognition identifies the name of the specified type from the text. We manually annotated a corpus with 1344 abstracts from microbial literature for the task of bacterial named entity recognition. Six new features were added in addition to the general features of the biomedical field. Based on a bacterial dictionary and conditional random field (CRF), the bacterial named entity recognition model was trained and it achieved a performance with precision 89.118%, recall 81.598 % and F-measure 85.192%. The system and template are available at https://github.com/bluelilywxy/BacNER-V1.0.git. © 2017 IEEE.

语种：英文

展开

导出

原文链接

认领

Learning the Multilingual Translation Representations for Question Retrieval in Community Question Answering via Non-Negative Matrix Factorization

作者： Zhou, Guangyou*;Xie, Zhiwen;He, Tingting（何婷婷）;Zhao, Jun;Hu, Xiaohua Tony

期刊： IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING,2016年24(7):1305-1314 ISSN：2329-9290

通讯作者： Zhou, Guangyou

作者机构： [He, Tingting; Zhou, Guangyou; Zhao, Jun; Xie, Zhiwen; Hu, Xiaohua Tony] Cent China Normal Univ, Sch Comp Sci, Wuhan 430079, Peoples R China.;[Zhao, Jun] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100080, Peoples R China.;[Hu, Xiaohua Tony] Drexel Univ, Coll Comp & Informat, Philadelphia, PA 19104 USA.

通讯机构： [Zhou, Guangyou] C;Cent China Normal Univ, Sch Comp Sci, Wuhan 430079, Peoples R China.

关键词： Community Question Answering;Information Retrieval;Natural Language Processing;Question Retrieval;Text Mining

摘要： Community question answering (CQA) has become an increasingly popular research topic. In this paper, we focus on the problem of question retrieval. Question retrieval in CQA can automatically find the most relevant and recent questions that have been solved by other users. However, the word ambiguity and word mismatch problems bring about new challenges for question retrieval in CQA. State-of-the-art approaches address these issues by implicitly expanding the queried questions with additional words or phrases using monolingual translation models. While useful, the effectiveness of these models is highly dependent on the availability of quality parallel monolingual corpora (e.g., question-answer pairs) in the absence of which they are troubled by noise issues. In this work, we propose an alternative way to address the word ambiguity and word mismatch problems by taking advantage of potentially rich semantic information drawn from other languages. Our proposed method employs statistical machine translation to improve question retrieval and enriches the question representation with the translated words from other languages via non-negative matrix factorization. Experiments conducted on real CQA data sets show that our proposed approach is promising.

语种：英文

展开

导出

原文链接

认领

Identification of the clustering structure in microbiome data by density clustering on the Manhattan distance

作者： Jiang, Xingpeng（蒋兴鹏）;Hu, Xiaohua;He, Tingting*（何婷婷）

期刊： 中国科学：信息科学(英文版),2016年59(7):070104-1-070104-7 ISSN：1674-733X

通讯作者： He, Tingting（何婷婷）

作者机构： [Jiang, Xingpeng; He, Tingting; Hu, Xiaohua] Cent China Normal Univ, Sch Comp Sci, Wuhan 430079, Peoples R China.;[Hu, Xiaohua] Drexel Univ, Coll Comp & Informat, Philadelphia, PA 19104 USA.

通讯机构： [He, Tingting] C;Cent China Normal Univ, Sch Comp Sci, Wuhan 430079, Peoples R China.

关键词： microbiome;information distance;data visualization;density clustering;microbial community

摘要： Clustering technology is a method for grouping data points into clusters containing a group of similar data points. In a real dataset such as microbiome data, the data points are presented as profiles or a probability distribution. These data points form the periphery of a cluster, making it difficult to identify the real clustering structure. In this study, we used density clustering on several distance measures to overcome this difficulty. Experiments using a real dataset indicated that the Manhattan distance is an appropriate distance measure for clustering analysis of microbiome data.

语种：英文

展开

导出

原文链接

认领

Exploiting Semantic Coherence Features for Information Retrieval

作者： Tu, Xinhui*;Huang, Jimmy Xiangji;Luo, Jing;He, Tingting（何婷婷）

作者机构： [He, Tingting; Huang, Jimmy Xiangji; Tu, Xinhui] Cent China Normal Univ, Sch Comp Sci, Wuhan, Hubei, Peoples R China.;[Huang, Jimmy Xiangji] York Univ, Sch Informat Technol, Toronto, ON, Canada.;[Luo, Jing] Wuhan Univ Sci & Technol, Sch Comp Sci, Wuhan, Hubei, Peoples R China.

会议名称： 39th International ACM SIGIR conference on Research and Development in Information Retrieval

会议时间： JUL 17-21, 2016

会议地点： Pisa, ITALY

会议主办单位： [Tu, Xinhui;Huang, Jimmy Xiangji;He, Tingting] Cent China Normal Univ, Sch Comp Sci, Wuhan, Hubei, Peoples R China.^[Huang, Jimmy Xiangji] York Univ, Sch Informat Technol, Toronto, ON, Canada.^[Luo, Jing] Wuhan Univ Sci & Technol, Sch Comp Sci, Wuhan, Hubei, Peoples R China.

关键词： Document ranking;Retrieval model;Term weighting

摘要： Most of the existing information retrieval models assume that the terms of a text document are independent of each other. These retrieval models integrate three major variables to determine the degree of importance of a term for a document: within document term frequency, document length and the specificity of the term in the collection. Intuitively, the importance of a term for a document is not only dependent on the three aspects mentioned above, but also dependent on the degree of semantic coherence between the term and the document. In this paper, we propose a heuristic approach, in which the degree of semantic coherence of the query terms with a document is adopted to improve the information retrieval performance. Experimental results on standard TREC collections show the proposed models consistently outperform the state-of-the-art models.

语种：英文

展开

导出

原文链接

认领

Hessian regularization based symmetric nonnegative matrix factorization for clustering gene expression and microbiome data

作者： Ma, Yuanyuan;Hu, Xiaohua;He, Tingting（何婷婷）;Jiang, Xingpeng*（蒋兴鹏）

期刊： Methods,2016年111:80-84 ISSN：1046-2023

通讯作者： Jiang, Xingpeng

作者机构： [Ma, Yuanyuan] Cent China Normal Univ, Sch Informat Management, Wuhan 430079, Peoples R China.;[Jiang, Xingpeng; He, Tingting; Hu, Xiaohua] Cent China Normal Univ, Sch Comp, Wuhan 430079, Peoples R China.

通讯机构： [Jiang, Xingpeng] C;Cent China Normal Univ, Sch Comp, Wuhan 430079, Peoples R China.

关键词： *Data clustering;*Hessian regularization;*Laplacian regularization;*Symmetric nonnegative matrix factorization

摘要： Nonnegative matrix factorization (NMF) has received considerable attention due to its interpretation of observed samples as combinations of different components, and has been successfully used as a clustering method. As an extension of NMF, Symmetric NMF (SNMF) inherits the advantages of NMF. Unlike NMF, however, SNMF takes a nonnegative similarity matrix as an input, and two lower rank nonnegative matrices (H, H-T) are computed as an output to approximate the original similarity matrix. Laplacian regularization has improved the clustering performance of NMF and SNMF. However, Laplacian regularization (LR), as a classic manifold regularization method, suffers some problems because of its weak extrapolating ability. In this paper, we propose a novel variant of SNMF, called Hessian regularization based symmetric nonnegative matrix factorization (HSNMF), for this purpose. In contrast to Laplacian regularization, Hessian regularization fits the data perfectly and extrapolates nicely to unseen data. We conduct extensive experiments on several datasets including text data, gene expression data and HMP (Human Microbiome Project) data. The results show that the proposed method outperforms other methods, which suggests the potential application of HSNMF in biological data clustering. (C) 2016 Published by Elsevier Inc.

语种：英文

展开

导出

原文链接

认领

Knowledge Base Question Answering Based on Deep Learning Models

作者： Xie, Zhiwen;Zeng, Zhao;Zhou, Guangyou*;He, Tingting（何婷婷）

期刊： Lecture Notes in Computer Science,2016年10102:300-311 ISSN：0302-9743

通讯作者： Zhou, Guangyou

作者机构： [He, Tingting; Zeng, Zhao; Zhou, Guangyou; Xie, Zhiwen] Cent China Normal Univ, Sch Comp, Wuhan 430079, Peoples R China.

通讯机构： [Zhou, Guangyou] C;Cent China Normal Univ, Sch Comp, Wuhan 430079, Peoples R China.

会议名称：第五届自然语言处理与中文计算会议(NLPCC-ICCPOL2016)

会议时间： 2016-12-02

会议地点：昆明

会议主办单位： Kunming Univ Sci & Technol

会议论文集名称：第五届自然语言处理与中文计算会议(NLPCC-ICCPOL2016)论文集

摘要： This paper focuses on the task of knowledge-based question answering (KBQA). KBQA aims to match the questions with the structured semantics in knowledge base. In this paper, we propose a two-stage method. Firstly, we propose a topic entity extraction model (TEEM) to extract topic entities in questions, which does not rely on hand-crafted features or linguistic tools. We extract topic entities in questions with the TEEM and then search the knowledge triples which are related to the topic entities from the knowledge base as the candidate knowledge triples. Then, we apply Deep Structured Semantic Models based on convolutional neural network and bidirectional long short-term memory to match questions and predicates in the candidate knowledge triples. To obtain better training dataset, we use an iterative approach to retrieve the knowledge triples from the knowledge base. The evaluation result shows that our system achieves an \(\text {Average} F_1\) measure of 79.57% on test dataset.

语种：英文

展开

导出

原文链接

认领

Cross-lingual sentiment classification with stacked autoencoders

作者： Zhou, Guangyou*;Zhu, Zhiyuan;He, Tingting（何婷婷）;Hu, Xiaohua Tony

期刊： Knowledge and Information Systems,2016年47(1):27-44 ISSN：0219-1377

通讯作者： Zhou, Guangyou

作者机构： [Zhou, Guangyou; Hu, Xiaohua Tony] Cent China Normal Univ, Sch Comp, Wuhan 430079, Peoples R China.;[He, Tingting] Cent China Normal Univ, Sch Comp, Nat Language Proc Lab, Wuhan 430079, Peoples R China.;[Zhu, Zhiyuan] Chinese Inst Elect, Beijing 100036, Peoples R China.;[Hu, Xiaohua Tony] Drexel Univ, Coll Comp & Informat, Philadelphia, PA 19104 USA.

通讯机构： [Zhou, Guangyou] C;Cent China Normal Univ, Sch Comp, Wuhan 430079, Peoples R China.

关键词： Sentiment classification;Cross-lingual;Stacked autoencoder

摘要： Cross-lingual sentiment classification is a popular research topic in natural language processing. The fundamental challenge of cross-lingual learning stems from a lack of overlap between the feature spaces of the source language data and the target language data. In this article, we propose a new model which uses stacked autoencoders to learn language-independent high-level feature representations for the both languages in an unsupervised fashion. The proposed framework aims to force the aligned input bilingual sentences into a common latent space, and the objective function is defined by minimizing the input and output vector representations as well as the distance of the common representations in the latent space. Sentiment classifiers trained on the source language can be adapted to predict sentiment polarity of the target language with the language-independent high-level feature representations. We conduct extensive experiments on English–Chinese sentiment classification tasks of multiple data sets. Our experimental results demonstrate the efficacy of the proposed cross-lingual approach. © 2015, Springer-Verlag London.

语种：英文

展开

导出

原文链接

认领

Learning semantic representation with neural networks for community question answering retrieval

作者： Zhou, Guangyou*;Zhou, Yin;He, Tingting（何婷婷）;Wu, Wensheng

期刊： Knowledge-Based Systems,2016年93(C):75-83 ISSN：0950-7051

通讯作者： Zhou, Guangyou

作者机构： [He, Tingting; Zhou, Guangyou; Zhou, Yin] Cent China Normal Univ, Sch Comp, Wuhan 430079, Peoples R China.;[Wu, Wensheng] Univ So Calif, Dept Comp Sci, Los Angeles, CA 90089 USA.

通讯机构： [Zhou, Guangyou] C;Cent China Normal Univ, Sch Comp, Wuhan 430079, Peoples R China.

关键词： Community question answering;Question retrieval;Text mining;Yahoo! Answers

摘要： Learning the semantic representation using neural network architecture.The neural network is trained via pre-training and fine-tuning phase.The learned semantic level feature is incorporated into a LTR framework. In community question answering (cQA), users pose queries (or questions) on portals like Yahoo! Answers which can then be answered by other users who are often knowledgeable on the subject. cQA is increasingly popular on the Web, due to its convenience and effectiveness in connecting users with queries and those with answers. In this article, we study the problem of finding previous queries (e.g., posed by other users) which may be similar to new queries, and adapting their answers as the answers to the new queries. A key challenge here is to the bridge the lexical gap between new queries and old answers. For example, "company" in the queries may correspond to "firm" in the answers. To address this challenge, past research has proposed techniques similar to machine translation that "translate" old answers to ones using the words in the new queries. However, a key limitation of these works is that they assume queries and answers are parallel texts, which is hardly true in reality. As a result, the translated or rephrased answers may not look intuitive.In this article, we propose a novel approach to learn the semantic representation of queries and answers by using a neural network architecture. The learned semantic level features are finally incorporated into a learning to rank framework. We have evaluated our approach using a large-scale data set. Results show that the approach can significantly outperform existing approaches. Learning the semantic representation using neural network architecture.The neural network is trained via pre-training and fine-tuning phase.The learned semantic level feature is incorporated into a LTR framework. In community question answering (cQA), users pose queries (or questions) on portals like Yahoo! Answers which can then be answered by other users who are often knowledgeable on the subject. cQA is increasingly popular on the Web, due to its convenience and effectiveness in connecting users with queries and those with answers. In this article, we study the problem of finding previous queries (e.g., posed by other users) which may be similar to new queries, and adapting their answers as the answers to the new queries. A key challenge here is to the bridge the lexical gap between new queries and old answers. For example, "company" in the queries may correspond to "firm" in the answers. To address this challenge, past research has proposed techniques similar to machine translation that "translate" old answers to ones using the words in the new queries. However, a key limitation of these works is that they assume queries and answers are parallel texts, which is hardly true in reality. As a result, the translated or rephrased answers may not look intuitive.In this article, we propose a novel approach to learn the semantic representation of queries and answers by using a neural network architecture. The learned semantic level features are finally incorporated into a learning to rank framework. We have evaluated our approach using a large-scale data set. Results show that the approach can significantly outperform existing approaches.

语种：英文

展开

导出

原文链接

认领

Neighbor affinity based algorithm for discovering temporal protein complex from dynamic PPI network

作者： Shen, Xianjun;Yi, Li;Jiang, Xingpeng（蒋兴鹏）;Zhao, Yanli;Hu, Xiaohua;...

期刊： Methods,2016年110:90-96 ISSN：1046-2023

通讯作者： Yang, Jincai（杨进才）

作者机构： [Jiang, Xingpeng; Yang, Jincai; He, Tingting; Shen, Xianjun; Hu, Xiaohua; Yi, Li; Zhao, Yanli] Cent China Normal Univ, Sch Comp, Wuhan, Peoples R China.;[Hu, Xiaohua] Drexel Univ, Coll Comp & Informat, Philadelphia, PA 19104 USA.

通讯机构： [Yang, Jincai] C;Cent China Normal Univ, Sch Comp, Wuhan, Peoples R China.

关键词： *Clustering coefficient;*Neighbor affinity;*Temporal protein complex;*Time course protein interaction networks

摘要： Detection of temporal protein complexes would be a great aid in furthering our knowledge of the dynamic features and molecular mechanism in cell life activities. Most existing clustering algorithms for discovering protein complexes are based on static protein interaction networks in which the inherent dynamics are often overlooked. We propose a novel algorithm DPC-NADPIN (Discovering Protein Complexes based on Neighbor Affinity and Dynamic Protein Interaction Network) to identify temporal protein complexes from the time course protein interaction networks. Inspired by the idea of that the tighter a protein’s neighbors inside a module connect, the greater the possibility that the protein belongs to the module, DPC-NADPIN algorithm first chooses each of the proteins with high clustering coefficient and its neighbors to consolidate into an initial cluster, and then the initial cluster becomes a protein complex by appending its neighbor proteins according to the relationship between the affinity among neighbors inside the cluster and that outside the cluster. In our experiments, DPC-NADPIN algorithm is proved to be reasonable and it has better performance on discovering protein complexes than the following state-of-the-art algorithms: Hunter, MCODE, CFinder, SPICI, and ClusterONE; Meanwhile, it obtains many protein complexes with strong biological significance, which provide helpful biological knowledge to the related researchers. Moreover, we find that proteins are assembled coordinately to form protein complexes with characteristics of temporality and spatiality, thereby performing specific biological functions.

语种：英文

展开

导出

原文链接

认领

Construction of relational word dictionary and learning of relational rules in PPI extraction from biomedical literatures

作者： Guo, Xiyue*;He, Tingting（何婷婷）;Xing, Ying

期刊： INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS,2016年15(2):125-144 ISSN：1748-5673

通讯作者： Guo, Xiyue

作者机构： [Guo, Xiyue] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan, Peoples R China.;[Guo, Xiyue] Xingyi Normal Univ Nationalities, Sch Informat Technol, Xingyi, Peoples R China.;[He, Tingting] Cent China Normal Univ, Sch Comp, Nat Language Proc Lab, Wuhan, Peoples R China.;[Xing, Ying] Zhongyuan Univ Technol, Software Coll, Zhengzhou, Peoples R China.

通讯机构： [Guo, Xiyue] C;[Guo, Xiyue] X;Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan, Peoples R China.;Xingyi Normal Univ Nationalities, Sch Informat Technol, Xingyi, Peoples R China.

关键词： PPI extraction;weakly supervised;word dictionary construction;rule learning

摘要： Each method, machine learning-based and rule-based, for extracting PPI (Protein-Protein Interactions) from biomedical literatures has advantages and disadvantages. In order to utilise the superiorities of these methods reasonably, this paper designs a new structure for the relational word dictionary, uses weakly supervised method to find dictionary items and fill them into the PPI relational word dictionary, and presents a method to learn PPI relational rules automatically based on slot-filling principle. Moreover, this method takes the PPI relation instances without apparent relational words into consideration aiming to improve the final performance. We conduct the experiments with five authoritative biomedical PPI corpuses, and discover some distribution features about PPI relational words. Finally, we also compare our method with several recent research achievements, and the results show that the performance of our method is better than the average level among these methods.

语种：英文

展开

导出

原文链接

认领

Realizing secret sharing with general access structure

作者： Harn, Lein;Hsu, Chingfang*;Zhang, Mingwu;He, Tingting（何婷婷）;Zhang, Maoyuan

期刊： Information Sciences,2016年367-368:209-220 ISSN：0020-0255

通讯作者： Hsu, Chingfang

作者机构： [Harn, Lein; Zhang, Mingwu] Hubei Univ Technol, Sch Comp Sci & Technol, Wuhan 430068, Peoples R China.;[Harn, Lein] Univ Missouri, Dept Comp Sci Elect Engn, Kansas City, MO 64110 USA.;[He, Tingting; Zhang, Maoyuan; Hsu, Chingfang] Cent China Normal Univ, Comp Sch, Wuhan 430079, Peoples R China.

通讯机构： [Hsu, Chingfang] C;Cent China Normal Univ, Comp Sch, Wuhan 430079, Peoples R China.

关键词： General secret sharing;Chinese remainder theorem;Secret sharing policy;Monotone function;Integer optimization;Minimal positive access subset;Maximal negative access subset

摘要： Secret sharing (SS) is one of the most important cryptographic primitives used for data outsourcing. The (t, n,) SS was introduced by Shamir and Blakley separately in 1979. The secret sharing policy of the (t, n) threshold SS is far too simple for many applications because it assumes that every shareholder has equal privilege to the secret or every shareholder is equally trusted. Ito et al. introduced the concept of a general secret sharing scheme (GSS). In a GSS, a secret is divided among a set of shareholders in such a way that any "qualified" subset of shareholders can access the secret, but any "unqualified" subset of shareholders cannot access the secret. The secret access structure of GSS is far more flexible than threshold SS. In this paper, we propose an optimized implementation of GSS. Our proposed scheme first uses Boolean logic to derive two important subsets, one is called Min which is the minimal positive access subset and the other is called Max which is the maximal negative access subset, of a given general secret sharing structure. Then, conditions of parameters of a GSS are established based on these two important subsets. Furthermore, integer linear/non-linear programming is used to optimize the size of shares of a GSS. The complexity of linear/non-linear programming is O(n), where n is the number of shares generated by the dealer. This proposed design can be applied to implement GSS based on any classical SS. However, our proposed method is limited to be applicable to some general secret sharing policies. We use two GSSs, one is based on Shamir's weighted SS (WSS) using linear polynomial and the other is based on Asmuth-Bloom's SS using Chinese Remainder Theorem (CRT), to demonstrate our design. In comparing with existing GSSs, our proposed scheme is more efficient and can be applied to all classical SSs. (C) 2016 Elsevier Inc. All rights reserved.

语种：英文

展开

导出

原文链接

认领

A novel proteins complex identification based on connected affinity and multi-level seed extension

作者： He, Tingting*（何婷婷）;Li, Peng;Hu, Xiaohua;Shen, Xianjun;Wang, Yan;...

期刊： INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS,2016年14(1):51-70 ISSN：1748-5673

通讯作者： He, Tingting（何婷婷）

作者机构： [He, Tingting; Shen, Xianjun; Hu, Xiaohua; Li, Peng; Zhao, Junmin; Wang, Yan] Cent China Normal Univ, Sch Comp, Wuhan, Peoples R China.;[Hu, Xiaohua] Drexel Univ, Coll Informat Sci & Technol, Philadelphia, PA 19104 USA.

通讯机构： [He, Tingting] C;Cent China Normal Univ, Sch Comp, Wuhan, Peoples R China.

关键词： complex networks;connected affinity model;multi-level seed extension model;algorithm CAMSE

摘要： The identification of modules in complex networks is important for the understanding of biological systems. Recent studies have shown those modules can be identified from the protein interaction network, what's more, the modules has not only relatively high density, but also has high coefficient of affinity. In this paper, we propose a novel algorithm based on Connected Affinity and Multi-level Seed Extension (CAMSE). First, CAMSE integrates Protein Interactions (PPI) with the protein Connected Coefficient (CC) inferred from protein complexes collected in the MIPS database to enhance the modularisation and biological character. Then we complete the seed selection, inner kernel extensions and outer extension to get core candidate function modules step by step. Finally, we integrated the modules with high repeat rate. The experimental results show that CAMSE can detect the functional modules much more effectively and accurately when it compared with other state-of-art algorithms CPM, CACE and IPC-MCE.

语种：英文

展开

导出

原文链接

认领

Leveraging Chinese encyclopedia for weakly supervised relation extraction

作者： Guo, Xiyue;He, Tingting（何婷婷）

期刊： Lecture Notes in Computer Science,2016年9544:127-140 ISSN：0302-9743

通讯作者： He, Tingting(tthe@mail.ccnu.edu.cn)

作者机构： [Guo, Xiyue] National Engineering Research Center for E-learning, Central China Normal University, Wuhan, China;[Guo, Xiyue] School of Information Technology, Xingyi Normal University for Nationalities, Xingyi, China;[He, Tingting] School of Computer, Central China Normal University, Wuhan, China

摘要： In the research of named-entity relation extraction based on supervision, selecting relation features for traditional methods are usually finished by people, and it’s hard to implement these methods for large-scale corpus. On the other hand, fixing relation types is the premise, so the practicabilities of these methods are not so ideal. This paper presents a weakly supervised method for Chinese named-entity relation extraction without man-made annotations, and the relation types in this method are not chosen artificially. The method collects entity relation types from the structured knowledge in encyclopedia pages, and then automatically annotates the relation instances existing in the texts based on these relation types. Simultaneously, the syntactic and semantic features of entity relations will be considered in this method, then the machine learning data will be completed, finally we use Support Vector Machine (SVM) model to train relation classifiers from training data, and these classifiers could try to extract entity relations from testing data. We carry out the experiment with the data from Chinese Baidu Encyclopedia pages, and the results show the effectiveness of this method, the overall F1 value reaches to 83.12%. In order to probe the universality of this method, we also use the acquired relation classifiers to extract entity relations from news texts, and the results manifest that this method owns certain universality. ©Springer International Publishing Switzerland 2016.

语种：英文

展开

导出

原文链接

认领

A Simple Enhancement for Ad-hoc Information Retrieval via Topic Modelling

作者： Jian, Fanghong;Huang, Jimmy Xiangji*;Zhao, Jiashu;He, Tingting（何婷婷）;Hu, Po

期刊： SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL,2016年:733-736

通讯作者： Huang, Jimmy Xiangji

作者机构： [Jian, Fanghong] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan, Hubei, Peoples R China.;[Huang, Jimmy Xiangji] Cent China Normal Univ, Informat Retrieval & Knowledge Management Res Lab, Wuhan, Hubei, Peoples R China.;[He, Tingting; Hu, Po] Cent China Normal Univ, Sch Comp Sci, Wuhan, Hubei, Peoples R China.;[Zhao, Jiashu] York Univ, Sch Informat Technol, Toronto, ON, Canada.

通讯机构： [Huang, Jimmy Xiangji] C;Cent China Normal Univ, Informat Retrieval & Knowledge Management Res Lab, Wuhan, Hubei, Peoples R China.

会议名称： 39th International ACM SIGIR conference on Research and Development in Information Retrieval

会议时间： JUL 17-21, 2016

会议地点： Pisa, ITALY

会议主办单位： [Huang, Jimmy Xiangji] Cent China Normal Univ, Informat Retrieval & Knowledge Management Res Lab, Wuhan, Hubei, Peoples R China.^[Jian, Fanghong] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan, Hubei, Peoples R China.^[He, Tingting;Hu, Po] Cent China Normal Univ, Sch Comp Sci, Wuhan, Hubei, Peoples R China.^[Zhao, Jiashu] York Univ, Sch Informat Technol, Toronto, ON, Canada.

关键词： Probabilistic Model;Dirichlet Language Model;LDA

摘要： Traditional information retrieval (IR) models, in which a document is normally represented as a bag of words and their frequencies, capture the term-level and document-level information. Topic models, on the other hand, discover semantic topic-based information among words. In this paper, we consider term-based information and semantic information as two features of query terms and propose a simple enhancement for ad-hoc IR via topic modeling. In particular, three topic-based hybrid models, LDA-BM25, LDA-MATF and LDA-LM, are proposed. A series of experiments on eight standard datasets show that our proposed models can always outperform significantly the corresponding strong baselines over all datasets in terms of MAP and most of datasets in terms of P@5 and P@20. A direct comparison on eight standard datasets also indicates our proposed models are at least comparable to the state-of-the-art approaches.

语种：英文

展开

导出

原文链接

认领

Mining Temporal Protein Complex Based on the Dynamic PIN Weighted with Connected Affinity and Gene Co-Expression

作者： Shen, Xianjun;Yi, Li;Jiang, Xingpeng（蒋兴鹏）;He, Tingting（何婷婷）;Hu, Xiaohua;...

期刊： PLOS ONE,2016年11(4):e0153967 ISSN：1932-6203

通讯作者： Yang, Jincai

作者机构： [Jiang, Xingpeng; Yang, Jincai; He, Tingting; Shen, Xianjun; Hu, Xiaohua; Yi, Li] Cent China Normal Univ, Sch Comp, Wuhan, Peoples R China.;[Hu, Xiaohua] Drexel Univ, Coll Comp & Informat, Philadelphia, PA 19104 USA.

通讯机构： [Yang, Jincai] C;Cent China Normal Univ, Sch Comp, Wuhan, Peoples R China.

关键词： Protein complexes;Protein interactions;Protein interaction networks;Gene expression;Algorithms;Molecular evolution;Protein expression;Protein metabolism

摘要： The identification of temporal protein complexes would make great contribution to our knowledge of the dynamic organization characteristics in protein interaction networks (PINs). Recent studies have focused on integrating gene expression data into static PIN to construct dynamic PIN which reveals the dynamic evolutionary procedure of protein interactions, but they fail in practice for recognizing the active time points of proteins with low or high expression levels. We construct a Time-Evolving PIN (TEPIN) with a novel method called Deviation Degree, which is designed to identify the active time points of proteins based on the deviation degree of their own expression values. Owing to the differences between protein interactions, moreover, we weight TEPIN with connected affinity and gene co-expression to quantify the degree of these interactions. To validate the efficiencies of our methods, ClusterONE, CAMSE and MCL algorithms are applied on the TEPIN, DPIN (a dynamic PIN constructed with state-of-the-art three-sigma method) and SPIN (the original static PIN) to detect temporal protein complexes. Each algorithm on our TEPIN outperforms that on other networks in terms of match degree, sensitivity, specificity, F-measure and function enrichment etc. In conclusion, our Deviation Degree method successfully eliminates the disadvantages which exist in the previous state-of-the-art dynamic PIN construction methods. Moreover, the biological nature of protein interactions can be well described in our weighted network. Weighted TEPIN is a useful approach for detecting temporal protein complexes and revealing the dynamic protein assembly process for cellular organization.

语种：英文

展开

导出

原文链接

认领

1 234 5 6 7... 12 共 12 页

成果认领

标题：

用户	作者	通讯作者	--
	请选择	请选择	--

确定

取消

成果认领

提示

该栏目需要登录且有访问权限才可以访问