期刊:
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS,2024年PP:1-12 ISSN:2168-2194
作者机构:
[Xueli Pan; Frank van Harmelen] Department of Computer Science, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands;Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan, China;National Engineering Research Center for E-Learning, Central China Normal University, Wuhan, China;National Language Resources Monitor Research Center for Network Media, Central China Normal University, Wuhan, China;School of Computer Science, Central China Normal University, Wuhan, China
摘要:
It is commonly known that food nutrition is closely related to human health. The complex interactions between food nutrients and diseases, influenced by gut microbial metabolism, present challenges in systematizing and practically applying knowledge. To address this, we propose a method for extracting triples from a vast amount of literature, which is used to construct a comprehensive knowledge graph on nutrition and human health. Concurrently, we develop a query-based question answering system over our knowledge graph, proficiently addressing three types of questions. The results show that our proposed model outperforms other state-of-art methods, achieving a precision of 0.92, a recall of 0.81, and an F1 score of 0.86in the nutrition and disease relation extraction task. Meanwhile, our question answering system achieves an accuracy of 0.68 and an F1 score of 0.61 on our benchmark dataset, showcasing competitiveness in practical scenarios. Furthermore, we design five independent experiments to assess the quality of the data structure in the knowledge graph, ensuring results characterized by high accuracy and interpretability. In conclusion, the construction of our knowledge graph shows significant promise in facilitating diet recommendations, enhancing patient care applications, and informing decision-making in clinical research.
期刊:
IEEE Transactions on Neural Networks and Learning Systems,2024年PP ISSN:2162-237X
通讯作者:
He, TT
作者机构:
[Fan, Rui] Cent China Normal Univ, Fac Artificial Intelligence Educ, Hubei Prov Key Lab Artificial Intelligence & Smart, Wuhan 430079, Peoples R China.;[Fan, Rui; Dong, Ming; Tu, Xinhui; Zhang, Mengyuan; He, Tingting; Chen, Menghan] Cent China Normal Univ, Natl Language Resources Monitor & Res Ctr Network, Wuhan 430079, Peoples R China.;[Dong, Ming; Tu, Xinhui; Zhang, Mengyuan; He, Tingting; Chen, Menghan] Cent China Normal Univ, Sch Comp, Hubei Prov Key Lab Artificial Intelligence & Smart, Wuhan 430079, Peoples R China.
通讯机构:
[He, TT ] C;Cent China Normal Univ, Natl Language Resources Monitor & Res Ctr Network, Wuhan 430079, Peoples R China.;Cent China Normal Univ, Sch Comp, Hubei Prov Key Lab Artificial Intelligence & Smart, Wuhan 430079, Peoples R China.
摘要:
Multimodal aspect-based sentiment classification (MABSC) aims to identify the sentiment polarity toward specific aspects in multimodal data. It has gained significant attention with the increasing use of social media platforms. Existing approaches primarily focus on analyzing the content of posts to predict sentiment. However, they often struggle with limited contextual information inherent in social media posts, hindering accurate sentiment detection. To overcome this issue, we propose a novel multimodal dual cause analysis (MDCA) method to track the underlying causes behind expressed sentiments. MDCA can provide additional reasoning cause (RC) and direct cause (DC) to explain why users express certain emotions, thus helping improve the accuracy of sentiment prediction. To develop a model with MDCA, we construct MABSC datasets with RC and DC by utilizing large language models (LLMs) and visual-language models. Subsequently, we devise a multitask learning framework that leverages the datasets with cause data to train a small generative model, which can generate RC and DC, and predict the sentiment assisted by these causes. Experimental results on MABSC benchmark datasets demonstrate that our MDCA model achieves the state-of-the-art performance, and the small fine-tuned model exhibits superior adaptability to MABSC compared to large models like ChatGPT and BLIP-2.
摘要:
Dialogue state tracking (DST) is a core component of task-oriented dialogue systems. Recent works focus mainly on end-to-end DST models that omit the spoken language understanding (SLU) module to directly obtain the dialogue state based on a user’s dialogue. However, the slot information detected by slot filling in SLU is closely tied to the slot–value pair that needs to be updated in DST. Efficient use of the key slot semantic knowledge obtained by slot filling contributes to improving the performance of DST. Based on this idea, we introduce slot filling as a subtask and build an end-to-end joint model to explicitly integrate the slot information detected by slot filling, which further guides DST. In this article, a novel stack-propagation framework with slot filling for multidomain DST is proposed. The stack-propagation framework is introduced to jointly model slot filling and DST. The framework directly feeds the key slot semantic knowledge detected by slot filling into the DST module. In addition, a slot-masked attention mechanism is designed to enable DST to focus on the key slot information obtained by slot filling. When the slot value is updated, a slot–value softcopy mechanism is designed to enhance the influence of the words marked by key slots. Experiments show that our approach outperforms previous methods and performs outstandingly on two benchmark datasets. IEEE
作者机构:
[Zhang, Miao] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan 430079, Peoples R China.;[Zhang, Miao; He, Tingting; Dong, Ming; Dong, M] Cent China Normal Univ, Hubei Prov Key Lab Artificial Intelligence & Smart, Wuhan 430079, Peoples R China.;[Zhang, Miao; He, Tingting; Dong, Ming; Dong, M] Cent China Normal Univ, Natl Language Resources Monitoring & Res Ctr Netwo, Wuhan 430079, Peoples R China.;[He, Tingting; Dong, Ming; Dong, M] Cent China Normal Univ, Sch Comp, Wuhan 430079, Peoples R China.
通讯机构:
[He, TT; Dong, M ] C;Cent China Normal Univ, Hubei Prov Key Lab Artificial Intelligence & Smart, Wuhan 430079, Peoples R China.;Cent China Normal Univ, Natl Language Resources Monitoring & Res Ctr Netwo, Wuhan 430079, Peoples R China.;Cent China Normal Univ, Sch Comp, Wuhan 430079, Peoples R China.
摘要:
Commonsense question answering (CQA) requires understanding and reasoning over QA context and related commonsense knowledge, such as a structured Knowledge Graph (KG). Existing studies combine language models and graph neural networks to model inference. However, traditional knowledge graph are mostly concept-based, ignoring direct path evidence necessary for accurate reasoning. In this paper, we propose MRGNN (Meta-path Reasoning Graph Neural Network), a novel model that comprehensively captures sequential semantic information from concepts and paths. In MRGNN, meta-paths are introduced as direct inference evidence and an original graph neural network is adopted to aggregate features from both concepts and paths simultaneously. We conduct sufficient experiments on the CommonsenceQA and OpenBookQA datasets, showing the effectiveness of MRGNN. Also, we conduct further ablation experiments and explain the reasoning behavior through the case study.
作者机构:
[Jiang, Xingpeng; Zhao, Weizhong; He, Tingting; Zhao, WZ; Wu, Junze] Cent China Normal Univ, Hubei Prov Key Lab Artificial Intelligence & Smart, Wuhan 430079, Hubei, Peoples R China.;[Zhao, Weizhong; Zhao, WZ] Cent China Normal Univ, Sch Comp, Wuhan 430079, Hubei, Peoples R China.;[Zhao, Weizhong] Cent China Normal Univ, Natl Language Resources Monitoring & Res Ctr Netwo, Wuhan 430079, Hubei, Peoples R China.;[Hu, Xiaohua] Drexel Univ, Coll Comp & Informat, Philadelphia, PA 19104 USA.
通讯机构:
[Jiang, XP ; Zhao, WZ ; Zhao, WZ] C;Cent China Normal Univ, Hubei Prov Key Lab Artificial Intelligence & Smart, Wuhan 430079, Hubei, Peoples R China.;Cent China Normal Univ, Sch Comp, Wuhan 430079, Hubei, Peoples R China.
摘要:
<jats:title>Abstract</jats:title>
<jats:sec>
<jats:title>Motivation</jats:title>
<jats:p>The crisis of antibiotic resistance, which causes antibiotics used to treat bacterial infections to become less effective, has emerged as one of the foremost challenges to public health. Identifying the properties of antibiotic resistance genes (ARGs) is an essential way to mitigate this issue. Although numerous methods have been proposed for this task, most of these approaches concentrate solely on predicting antibiotic class, disregarding other important properties of ARGs. In addition, existing methods for simultaneously predicting multiple properties of ARGs fail to account for the causal relationships among these properties, limiting the predictive performance.</jats:p>
</jats:sec>
<jats:sec>
<jats:title>Results</jats:title>
<jats:p>In this study, we propose a causality-guided framework for annotating properties of ARGs, in which causal inference is utilized for representation learning. More specifically, the hidden biological patterns determining the properties of ARGs are described by a Gaussian Mixture Model, and procedure of causal representation learning is used to derive the hidden features. In addition, a causal graph among different properties is constructed to capture the causal relationships among properties of ARGs, which is integrated into the task of annotating properties of ARGs. The experimental results on a real-world dataset demonstrate the effectiveness of the proposed framework on the task of annotating properties of ARGs.</jats:p>
</jats:sec>
<jats:sec>
<jats:title>Availability and implementation</jats:title>
<jats:p>The data and source codes are available in GitHub at https://github.com/David-WZhao/CausalARG.</jats:p>
</jats:sec>
期刊:
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS,2024年28(7):4348-4360 ISSN:2168-2194
通讯作者:
Zhao, WZ
作者机构:
[Zhao, Weizhong; Zhao, WZ; He, Tingting; Jiang, Xingpeng; Wu, Junze; Luo, Shujie] Cent China Normal Univ, Hubei Prov Key Lab Artificial Intelligence & Smart, Wuhan 430079, Peoples R China.;[Zhao, Weizhong; Zhao, WZ; He, Tingting; Jiang, Xingpeng; Wu, Junze; Luo, Shujie] Cent China Normal Univ, Sch Comp, Wuhan 430079, Peoples R China.;[Zhao, Weizhong; Zhao, WZ; He, Tingting; Jiang, Xingpeng; Wu, Junze; Luo, Shujie] Cent China Normal Univ, Natl Language Resources Monitoring & Res Ctr netwo, Wuhan 430079, Peoples R China.;[Hu, Xiaohua] Drexel Univ, Coll Comp & Informat, Philadelphia, PA 19104 USA.
通讯机构:
[Zhao, WZ ] C;Cent China Normal Univ, Hubei Prov Key Lab Artificial Intelligence & Smart, Wuhan 430079, Peoples R China.;Cent China Normal Univ, Sch Comp, Wuhan 430079, Peoples R China.;Cent China Normal Univ, Natl Language Resources Monitoring & Res Ctr netwo, Wuhan 430079, Peoples R China.
摘要:
The crisis of antibiotic resistance has become a significant global threat to human health. Understanding properties of antibiotic resistance genes (ARGs) is the first step to mitigate this issue. Although many methods have been proposed for predicting properties of ARGs, most of these methods focus only on predicting antibiotic classes, while ignoring other properties of ARGs, such as resistance mechanisms and transferability. However, acquiring all of these properties of ARGs can help researchers gain a more comprehensive understanding of the essence of antibiotic resistance, which will facilitate the development of antibiotics. In this paper, the task of predicting properties of ARGs is modeled as a multi-task learning problem, and an effective subtask-aware representation learning-based framework is proposed accordingly. More specifically, property-specific expert networks and shared expert networks are utilized respectively to learn subtask-specific features for each subtask and shared features among different subtasks. In addition, a gating-controlled mechanism is employed to dynamically allocate weights to subtask-specific semantics and shared semantics obtained respectively from property-specific expert networks and shared expert networks, thus adjusting distinctive contributions of subtask-specific features and shared features to achieve optimal performance for each subtask simultaneously. Extensive experiments are conducted on publicly available data, and experimental results demonstrate the effectiveness of the proposed framework on the task of ARGs properties prediction.
作者机构:
Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan, Hubei, PR China;School of Computer, Central China Normal University, Wuhan, Hubei, PR China;National Language Resources Monitoring & Research Center for Network Media, Central China Normal University, Wuhan, Hubei, PR China;[Xiaowei Xu] Department of Information Science, University of Arkansas at Little Rock, Little Rock, AR, USA;[Shengwei Ye; Weizhong Zhao; Xianjun Shen; Xingpeng Jiang; Tingting He] Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan, Hubei, PR China<&wdkj&>School of Computer, Central China Normal University, Wuhan, Hubei, PR China<&wdkj&>National Language Resources Monitoring & Research Center for Network Media, Central China Normal University, Wuhan, Hubei, PR China
会议名称:
2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
会议时间:
05 December 2023
会议地点:
Istanbul, Turkiye
会议论文集名称:
2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
关键词:
drug repositioning;drug-disease associations prediction;counterfactual links;graph convolutional network;heterogeneous information network
摘要:
Drug repositioning is the process of identifying potential associations between approved drugs and diseases (DDAs) to unveil novel therapeutic applications. Unlike traditional drug discovery approaches, a key advantage of drug repositioning lies in its capacity to leverage the existing knowledge and safety profiles of established medications, leading to significant reductions in both the time and costs associated with drug development. While various methods have been proposed to address this challenge using diverse strategies, the conventional approach for training DDAs prediction models typically relies on random sampling of unknown drug-disease pairs to construct negative samples. However, this method may inadvertently introduce unwanted noise or errors by erroneously categorizing some genuine DDAs as negative samples, thereby leaving room for improvement in current methodologies. In this paper, we introduce a novel negative sample selection algorithm for DDAs prediction that explicitly incorporates causal knowledge inherent in DDAs. To accomplish this, we first construct a heterogeneous information network (HIN) that encompasses various biological entities associated with DDAs and their interconnections. Subsequently, we utilize the outcomes of community detection within the HIN as a form of counterfactual inference, resulting in the development of a negative sample selection algorithm based on a thoughtfully designed counterfactual question. By combining the known DDAs (i.e., positive samples) with the newly generated negative samples, we train a prediction model that incorporates a graph learning module to acquire representations of drugs and diseases. Comprehensive experiments confirm the effectiveness of our proposed model for DDAs prediction.
作者机构:
Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan, China;School of Computer, Central China Normal University, Wuhan, China;National Language Resources Monitoring and Research Center for Network Media, Central China Normal University, Wuhan, China;[Xiaowei Xu] Department of Information Science, University of Arkansas at Little Rock, Little Rock, AR, USA;[Wenjie Yao] Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan, China<&wdkj&>School of Computer, Central China Normal University, Wuhan, China
会议名称:
2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
会议时间:
05 December 2023
会议地点:
Istanbul, Turkiye
会议论文集名称:
2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
关键词:
counterfactual inference;data augmentation;drug-side effect associations prediction;heterogeneous information network
摘要:
Detecting drug side effects is crucial in development of drugs. As publicly available biomedical data expands, researchers have devised numerous computational methods for predicting drug-side effect associations (DSAs). Among these, network-based approaches have gained significant attention in the biomedical field. However, the challenge of data scarcity poses a significant hurdle for existing DSAs prediction models. While various data augmentation methods have been created to solve the proble, most rely on random alterations to the original networks, neglecting the causality of DSAs’ existence, thus impacting the predictive performance negatively. In this paper, we introduce a counterfactual inference-based data augmentation method to enhance performance. First,a heterogeneous information network (HIN) is construct by integrating multiple biomedical data sources. We employ community detection on the HIN to preform a counterfactual inference-based method, deriving augmented links and an augmented HIN. Subsequently, we apply a meta-path-based graph neural network to obtain high-quality representations of drugs and side effects, enabling the prediction of DSAs. Our comprehensive experiments confirm the effectiveness of this counterfactual inference-based data augmentation for DSAs prediction.
作者:
Yi Jia;Shanshan Zheng;Tingting He;Xingpeng Jiang
作者机构:
School of Mathematics and Statistics, Central China Normal University, WuHan, PR China;Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, WuHan, PR China;School of Computer, Central China Normal University, WuHan, PR China;National Language Resources Monitoring & Research Center for Network Media, Central China Normal University, WuHan, PR China;[Yi Jia] School of Mathematics and Statistics, Central China Normal University, WuHan, PR China<&wdkj&>Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, WuHan, PR China<&wdkj&>School of Computer, Central China Normal University, WuHan, PR China
会议名称:
2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
会议时间:
05 December 2023
会议地点:
Istanbul, Turkiye
会议论文集名称:
2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
摘要:
Despite profound impacts on human health and nature, accurately predicting microbe-metabolite interactions remains challenging due to inherent data noise. This study applies non-negative matrix factorization (NMF) and multi-view NMF to reduce noise and exploit associations across data perspectives. NMF obtains low-dimensional microbial and metabolic representations, effectively reducing noise. The dimension-reduced spectral matrices were input into the generative network model to derive conditional probabilities of individual microbe-associated metabolites and microbe-metabolite co-occurrence probabilities, the latter enabling prediction of microbe-metabolite interactions. Moreover, multi-view NMF integrates microbial and metabolic data by mapping them into a shared subspace, thereby enhancing prediction performance and validating cross-perspective correlation modeling. This study demonstrates NMF's efficacy in noise reduction through dimensionality reduction, and multiview NMF's ability to leverage cross-view associations. Both approaches demonstrate enhanced microbe-metabolite interaction prediction utilizing NMF-based and multi-view NMF-based generative network models.
作者机构:
School of Computer, Central China Normal University, WuHan, PR China;Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, WuHan, PR China;National Language Resources Monitoring & Research Center for Network Media, Central China Normal University, WuHan, PR China;[Yanan Yao; Tian Yu; Huanghan Zhan; Weizhong Zhao; Tingting He; Xingpeng Jiang] School of Computer, Central China Normal University, WuHan, PR China<&wdkj&>Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, WuHan, PR China<&wdkj&>National Language Resources Monitoring & Research Center for Network Media, Central China Normal University, WuHan, PR China
会议名称:
2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
会议时间:
05 December 2023
会议地点:
Istanbul, Turkiye
会议论文集名称:
2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
摘要:
Antibiotic resistance event extraction involves the automated extraction of information related to antibiotic resistance mechanisms from a vast amount of biomedical literature. This can be achieved by utilizing natural language processing techniques. However, the distinctive characteristics of the biomedical field lead to various challenges for existing antibiotic resistance event extraction methods, such as limited labeling data, complex names of biomedical entities, and nesting and overlapping event structures. These factors make it challenging to apply the current processing methods for biomedical text to the task of antibiotic resistance event extraction. To address these challenges, we propose a cascade decoding approach for antibiotic resistance event extraction based on contrastive learning (CL-MA-CasEE). This approach achieves data augmentation by constructing two contrastive learning tasks, which combines entity type embedding and POS embedding to enrich the semantic information of word representations. Furthermore, it performs event type detection, event trigger extraction, and event argument extraction through using three cascade decoders to simulate the complex event structures. Based on experiments, we demonstrate that our method can effectively extract structured antibiotic resistance event information from biomedical literature, thereby improve the efficiency of event extraction tasks as well.
期刊:
BRIEFINGS IN BIOINFORMATICS,2023年24(2) ISSN:1467-5463
通讯作者:
Weizhong Zhao
作者机构:
[Shen, Xianjun; Zhao, Weizhong; Yuan, Xueling; He, Tingting; Jiang, Xingpeng] Cent China Normal Univ, Sch Comp, Wuhan, Peoples R China.;[Shi, Chuan] Beijing Univ Posts & Telecommun, Sch Comp Sci, Beijing, Peoples R China.;[Hu, Xiaohua] Drexel Univ, Coll Comp & Informat, Philadelphia, PA 19104 USA.
通讯机构:
[Weizhong Zhao] H;Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University , Wuhan, Hubei 430079, P R China<&wdkj&>School of Computer Science, Beijing University of Posts and Telecommunications , Beijing, 100876, P R China<&wdkj&>National Language Resources Monitoring & Research Center for Network Media, Central China Normal University , Wuhan, Hubei 430079, P R China
关键词:
drug–drug interaction;heterogeneous information network;meta-path-based information fusion
摘要:
<jats:title>Abstract</jats:title><jats:p>Drug–drug interactions (DDIs) are compound effects when patients take two or more drugs at the same time, which may weaken the efficacy of drugs or cause unexpected side effects. Thus, accurately predicting DDIs is of great significance for the drug development and the drug safety surveillance. Although many methods have been proposed for the task, the biological knowledge related to DDIs is not fully utilized and the complex semantics among drug-related biological entities are not effectively captured in existing methods, leading to suboptimal performance. Moreover, the lack of interpretability for the predicted results also limits the wide application of existing methods for DDIs prediction. In this study, we propose a novel framework for predicting DDIs with interpretability. Specifically, we construct a heterogeneous information network (HIN) by explicitly utilizing the biological knowledge related to the procedure of inducing DDIs. To capture the complex semantics in HIN, a meta-path-based information fusion mechanism is proposed to learn high-quality representations of drugs. In addition, an attention mechanism is designed to combine semantic information obtained from meta-paths with different lengths to obtain final representations of drugs for DDIs prediction. Comprehensive experiments are conducted on 2410 approved drugs, and the results of predictive performance comparison show that our proposed framework outperforms selected representative baselines on the task of DDIs prediction. The results of ablation study and cold-start scenario indicate that the meta-path-based information fusion mechanism red is beneficial for capturing the complex semantics among drug-related biological entities. Moreover, the results of case study demonstrate that the designed attention mechanism is able to provide partial interpretability for the predicted DDIs. Therefore, the proposed method will be a feasible solution to the task of predicting DDIs.</jats:p>
期刊:
Artificial Intelligence in Medicine,2023年145:102677 ISSN:0933-3657
通讯作者:
Jiang, XP
作者机构:
[Fu, Chengcheng] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan, Peoples R China.;[Jiang, Xingpeng; Fu, Chengcheng; He, Tingting] Cent China Normal Univ, Sch Comp Sci, Wuhan, Peoples R China.;[Fu, Chengcheng; van Harmelen, Frank; Huang, Zhisheng] Vrije Univ Amsterdam, Dept Comp Sci, Amsterdam, Netherlands.;[Fu, Chengcheng; He, Tingting; Jiang, Xingpeng] Cent China Normal Univ, Natl Language Resources Monitor Res Ctr Network Me, Wuhan, Peoples R China.;[Huang, Zhisheng] Tongji Univ, Sch Med, Clin Res Ctr Mental Disorders, Shanghai Pudong New Area Mental Hlth Ctr, Shanghai, Peoples R China.
通讯机构:
[Jiang, XP ] C;Cent China Normal Univ, Sch Comp Sci, Wuhan, Peoples R China.
关键词:
Food;Gut microbiota;Knowledge graph;Mental health
作者机构:
School of Computer, Central China Normal University, WuHan, PR China;Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, WuHan, PR China;National Language Resources Monitor Research Center for Network Media, Central China Normal University, WuHan, PR China;National Engineering Research Center for E-Learning, Central China Normal University, WuHan, PR China;[Chengcheng Fu; Tingting He] School of Computer, Central China Normal University, WuHan, PR China<&wdkj&>National Engineering Research Center for E-Learning, Central China Normal University, WuHan, PR China<&wdkj&>Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, WuHan, PR China<&wdkj&>National Language Resources Monitor Research Center for Network Media, Central China Normal University, WuHan, PR China
会议名称:
2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
会议时间:
05 December 2023
会议地点:
Istanbul, Turkiye
会议论文集名称:
2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
关键词:
Knowledge graph;Multimodal embedding;Knowledge reasoning;Nutrition;Human health
摘要:
The established links between nutrition and human health are widely acknowledged. Dietary nutrients play a crucial role in regulating gut microbial communities, influencing various human diseases. With a growing number of related studies, there’s a need to systematically organize these associations for coherent knowledge reasoning. However, due to the diverse and extensive nature of the knowledge landscape, significant challenges persist. To address this, we propose an approach using multimodal data and knowledge embeddings for effective knowledge reasoning in nutrition and human health. We create a comprehensive knowledge graph, KG4NH, covering dietary nutrition, gut microbiota, and human diseases. To ensure efficient knowledge representation, we employ knowledge embedding techniques to develop modality-specific encoders for structure, category, and description. Additionally, we introduce a mul-timodal fusion method to capture shared information across modalities. Our experimental results demonstrate the superiority of our approach over other state-of-the-art methods.
期刊:
BRIEFINGS IN BIOINFORMATICS,2023年24(2) ISSN:1467-5463
通讯作者:
Xingpeng Jiang
作者机构:
[Wang, Haodong; Wang, Yue; Xiao, Zhen; Huang, Xiaoyun; He, Tingting; Jiang, Xingpeng; Sun, Han] Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan 430079, China;[Wang, Haodong; Wang, Yue; Xiao, Zhen; Huang, Xiaoyun; He, Tingting; Jiang, Xingpeng; Sun, Han] School of Computer Science, Central China Normal University, Wuhan 430079, China;[Xiao, Zhen; Sun, Han] School of Mathematics and Statistics, Central China Normal University, Wuhan 430079, China;[Huang, Xiaoyun] Collaborative & Innovative Center for Educational Technology, Central China Normal University, Wuhan 430079, China;[He, Tingting; Jiang, Xingpeng] National Language Resources Monitoring & Research Center for Network Media, Central China Normal University, Wuhan 430079, China
通讯机构:
[Xingpeng Jiang] H;Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University , Wuhan 430079 , China<&wdkj&>School of Computer Science, Central China Normal University , Wuhan 430079 , China<&wdkj&>National Language Resources Monitoring & Research Center for Network Media, Central China Normal University , Wuhan 430079 , China
关键词:
Kernel machine regression;Microbiome-based association test;Multinomial logit model;Ordinal/Nominal multicategory phenotypes
摘要:
<jats:title>Abstract</jats:title><jats:p>Microbes can affect the metabolism and immunity of human body incessantly, and the dysbiosis of human microbiome drives not only the occurrence but also the progression of disease (i.e. multiple statuses of disease). Recently, microbiome-based association tests have been widely developed to detect the association between the microbiome and host phenotype. However, the existing methods have not achieved satisfactory performance in testing the association between the microbiome and ordinal/nominal multicategory phenotypes (e.g. disease severity and tumor subtype). In this paper, we propose an optimal microbiome-based association test for multicategory phenotypes, namely, multiMiAT. Specifically, under the multinomial logit model framework, we first introduce a microbiome regression-based kernel association test for multicategory phenotypes (multiMiRKAT). As a data-driven optimal test, multiMiAT then integrates multiMiRKAT, score test and MiRKAT-MC to maintain excellent performance in diverse association patterns. Massive simulation experiments prove the success of our method. Furthermore, multiMiAT is also applied to real microbiome data experiments to detect the association between the gut microbiome and clinical statuses of colorectal cancer as well as for diverse statuses of Clostridium difficile infections.</jats:p>
作者机构:
Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan, China;School of Computer, Central China Normal University, Wuhan, China;National Language Resources Monitoring and Research Center for Network Media, Central China Normal University, Wuhan, China;[Xiaowei Xu] Department of Information Science, University of Arkansas at Little Rock, Little Rock, AR, USA;[Xueling Yuan] Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan, China<&wdkj&>School of Computer, Central China Normal University, Wuhan, China
会议名称:
2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
会议时间:
05 December 2023
会议地点:
Istanbul, Turkiye
会议论文集名称:
2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
关键词:
drug–drug interaction;cold-start scenario;counterfactual inference;meta-path-based fusion;heterogeneous information network
摘要:
Drug-drug interaction (DDI) pertains to the occurrence where the concomitant use of two or more drugs may lead to interactions in terms of their pharmacokinetic or pharmacodynamic behavior, resulting in unexpected effects. Accurately predicting DDIs holds significant importance in ensuring drug safety. Despite the numerous approaches proposed for DDI prediction, a majority of these methods often overlook the challenge presented by cold-start scenario, consequently limiting their applicability. This paper presents a novel data augmentation approach for the prediction of DDIs in cold-start scenarios. This method leverages counterfactual inference to generate meaningful pseudo samples for drugs with limited prior information. To achieve this, a HIN relevant to DDIs is initially established by amalgamating various associations between drugs and proteins. Subsequently, the identification of drug communities within this HIN is regarded as a form of counterfactual inference treatment, facilitating the generation of counterfactual links for cold-start drugs and thereby augmenting the training dataset. Lastly, we enhance our understanding of drug characteristics through a meta-path-based fusion mechanism, ultimately improving the accuracy of DDIs prediction in cold-start scenarios. We substantiate the effectiveness of our proposed method through an extensive series of experiments.
期刊:
Information Processing & Management,2023年60(1):103114 ISSN:0306-4573
通讯作者:
Weizhong Zhao
作者机构:
[Zhao, Weizhong; Xia, Jun; He, Tingting; Jiang, Xingpeng] Cent China Normal Univ, Hubei Prov Key Lab Artificial Intelligence & Smart, Wuhan 430079, Hubei, Peoples R China.;[Zhao, Weizhong] Cent China Normal Univ, Sch Comp, Wuhan 430079, Hubei, Peoples R China.;[Zhao, Weizhong] Cent China Normal Univ, Natl Language Resources Monitoring & Res Ctr Netwo, Wuhan 430079, Hubei, Peoples R China.
通讯机构:
[Weizhong Zhao] H;Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan 430079, Hubei, China<&wdkj&>School of Computer, Central China Normal University, Wuhan 430079, Hubei, China<&wdkj&>National Language Resources Monitoring and Research Center for Network Media, Central China Normal University, Wuhan 430079, Hubei, China
关键词:
Deep knowledge tracing;Forgetting and learning mechanisms;Intelligent education
期刊:
IEEE/ACM Transactions on Computational Biology and Bioinformatics,2023年20(6):3635-3647 ISSN:1545-5963
作者机构:
[Shi, Chuan] School of Computer Science, Beijing University of Posts and Telecommunications, Beijing, China;[Yao, Wenjie; Jiang, Xingpeng; He, Tingting] School of Computer, Central China Normal University, Wuhan, Hubei, China;[Hu, Xiaohua] College of Computing &Informatics, Drexel University, Philadelphia, PA, USA;[Zhao, Weizhong] Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, School of Computer, and National Language Resources Monitoring and Research Center for Network Media, Central China Normal University, Wuhan, Hubei, China
摘要:
Side effects of drugs have gained increasing attention in the biomedical field, and accurate identification of drug side effects is essential for drug development and drug safety surveillance. Although the traditional pharmacological experiments can accurately detect the side effects of drugs, the identifying process is time-consuming, costly, and may lead to incomplete identification of side effects. With the expanding of various biomedical databases, many computational methods have been developed for the task of drug-side effect associations (DSAs) prediction. However, existing methods have the following three drawbacks: 1). multiple drug-related databases are not fully used; 2). the complex semantics among drugs and side effects are not effectively captured; 3). the explainability of the predicted DSAs is missed for most existing methods. Therefore, there is an urgent need to find a more effective method for predicting DSAs. To address these issues, we propose a novel meta-path-based graph neural network model for drug-side effect associations prediction (MPGNN-DSA). In MPGNN-DSA, a heterogeneous information network is first constructed by combining multiple biological datasets. Then, a meta-path-based feature learning module is utilized for learning high-quality representations of drugs and side effects by capturing the semantics contained in meta-paths of the constructed HIN. With the learned features, the prediction module is conducted to derive the predicted side effects for drugs. In addition, the explainability of the predicted DSAs can be provided as well with the semantics contained in meta-paths. We conduct comprehensive experiments, and the results demonstrate the effectiveness of MPGNN-DSA, suggesting that the proposed method will be a feasible solution to the task of DSAs prediction. Side effects of drugs have gained increasing attention in the biomedical field, and accurate identification of drug side effects is essential for drug development and drug safety surveillance. Although the traditional pharmacological experiments can accurately detect the side effects of drugs, the identifying process is time-consuming, costly, and may lead to incomplete identification of side effects. With the expanding of various biomedical databases, many computational methods have been developed for the task of drug-side effect associations (DSAs) prediction. However, existing methods have the following three drawbacks: 1). multiple drug-related databases are not fully used; 2). the complex semantics among drugs and side effects are not effectively captured; 3). the explainability of the predicted DSAs is missed for most existing methods. Therefore, there is an urgent need to find a more effective method for predicting DSAs. To address these issues, we propose a novel meta-path-based graph neural network model for drug-side effect associations prediction (MPGNN-DSA). In MPGNN-DSA, a heterogeneous information network is first constructed by combining multiple biological datasets. Then, a meta-path-based feature learning module is utilized for learning high-quality representations of drugs and side effects by capturing the semantics contained in meta-paths of the constructed HIN. With the learned features, the prediction module is conducted to derive the predicted side effects for drugs. In addition, the explainability of the predicted DSAs can be provided as well with the semantics contained in meta-paths. We conduct comprehensive experiments, and the results demonstrate the effectiveness of MPGNN-DSA, suggesting that the proposed method will be a feasible solution to the task of DSAs prediction.
摘要:
Drug repurposing, which typically applies the procedure of drug-disease associations (DDAs) prediction, is a feasible solution to drug discovery. Compared with traditional methods, drug repurposing can reduce the cost and time for drug development and advance the success rate of drug discovery. Although many methods for drug repurposing have been proposed and the obtained results are relatively acceptable, there is still some room for improving the predictive performance, since those methods fail to consider fully the issue of sparseness in known drug-disease associations. In this paper, we propose a novel multi-task learning framework based on graph representation learning to identify DDAs for drug repurposing. In our proposed framework, a heterogeneous information network is first constructed by combining multiple biological datasets. Then, a module consisting of multiple layers of graph convolutional networks is utilized to learn low-dimensional representations of nodes in the constructed heterogeneous information network. Finally, two types of auxiliary tasks are designed to help to train the target task of DDAs prediction in the multi-task learning framework. Comprehensive experiments are conducted on real data and the results demonstrate the effectiveness of the proposed method for drug repurposing.
摘要:
The increasing prevalence of antibiotic resistance has become a global health crisis. For the purpose of safety regulation, it is of high importance to identify antibiotic resistance genes (ARGs) in bacteria. Although culture-based methods can identify ARGs relatively more accurately, the identifying process is time-consuming and specialized knowledge is required. With the rapid development of whole genome sequencing technology, researchers attempt to identify ARGs by computing sequence similarity from public databases. However, these computational methods might fail to detect ARGs due to the low sequence identity to known ARGs. Moreover, existing methods cannot effectively address the issue of multidrug resistance prediction for ARGs, which is a great challenge to clinical treatments. To address the challenges, we propose an end-to-end multi-label learning framework for predicting ARGs. More specifically, the task of ARGs prediction is modeled as a problem of multi-label learning, and a deep neural network-based end-to-end framework is proposed, in which a specific loss function is introduced to employ the advantage of multi-label learning for ARGs prediction. In addition, a dual-view modeling mechanism is employed to make full use of the semantic associations among two views of ARGs, i.e. sequence-based information and structure-based information. Extensive experiments are conducted on publicly available data, and experimental results demonstrate the effectiveness of the proposed framework on the task of ARGs prediction.
作者:
Shengwei Ye;Weizhong Zhao;Xianjun Shen;Xingpeng Jiang;Tingting He
作者机构:
Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan, Hubei, PR China;School of Computer, Central China Normal University, Wuhan, Hubei, PR China;National Language Resources Monitoring & Research Center for Network Media Central China Normal University, Wuhan, Hubei, PR China;[Shengwei Ye; Weizhong Zhao; Xianjun Shen; Xingpeng Jiang; Tingting He] Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan, Hubei, PR China<&wdkj&>School of Computer, Central China Normal University, Wuhan, Hubei, PR China<&wdkj&>National Language Resources Monitoring & Research Center for Network Media Central China Normal University, Wuhan, Hubei, PR China
会议名称:
2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
会议时间:
06 December 2022
会议地点:
Las Vegas, NV, USA
会议论文集名称:
2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
关键词:
drug repositioning;drug-disease associations prediction;multi-task learning;graph convolutional network;heterogeneous information network
摘要:
Compared with traditional methods, drug repositioning is a viable solution to drug discovery. Drug repositioning usually applies the procedure of drug-disease associations (DDAs) prediction, which can reduce the cost and time of drug development and improve the success rate of drug discovery. In this paper, we develop a new multi-task learning framework based on heterogeneous graph convolutional network (MTHGCN) to recognize potential DDAs. In MTHGCN, a heterogeneous information network is constructed by combining multiple biological datasets. And then, a module based on graph convolutional networks is utilized to learn low-dimensional representations of drugs and diseases. Finally, we design two types of auxiliary tasks to help to train the target DDAs prediction task based on the multi-task learning mechanism. We conduct comprehensive experiments on MTHGCN. The results demonstrate the effectiveness of MTHGCN for drug repositioning.