国家数字化学习工程技术研究中心

首页 > 院系 > 详情

National Engineering Research Center for E-Learning

国家数字化学习工程技术研究中心(National Engineering Research Center for E-Learning, NERCEL)依托华中师范大学组建，是国内从事教育信息化技术研究和科研成果转化的专门研发机构，于2004年经湖北省发展和改...

发文量

1419

高被引

2

SCI-E

263

SSCI

132

A&HCI

1

CPCI-S

243

EI

190

Medline

12

CSCD

84

CSSCI

216

院系成果

院系学者

院系分析

成果类型

请选择成果类型

全部

期刊论文

会议论文

筛选

开始检索

已无可筛选条件

成果类型

期刊论文

263

会议论文

年份 (2011~2024)

年

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

语种

英文

262

中文

期刊

Infrared Physics & Technology

IEEE ACCESS

Neurocomputing

Expert Systems with Applications

IEEE Transactions on Industrial Informatics

Multimedia Tools and Applications

International Journal of Pattern Recognition and Artificial Intelligence

Knowledge-Based Systems

Sensors

Applied Optics

Journal of King Saud University - Computer and Information Sciences

Behaviour & Information Technology

Computers & Education

IEEE Transactions on Neural Networks and Learning Systems

Information Sciences

Sustainability

Wireless Personal Communications

Applied Sciences-Basel

Applied Soft Computing

Computing

作者

Zhang, Zhaoli

Liu, Hai

Chen, Jingying

Liu, Sanya

Liu, Hai

He, Tingting

Liu, Tingting

Yang, Zongkai

Chen, Zengzhao

Liu, Sannyuya

Sun, Jianwen

Liu, Tingting

Du, Xu

Shen, Xiaoxuan

Liu, Yanshen

Shu, Jiangbo

Xi, Jiangtao

Zhao, Liang

Chen, Jingying

Chen, Dan

关键词

Head pose estimation

Convolutional neural network

Facial expression recognition

Deep learning

Regularization

Task analysis

Training

deep learning

Feature extraction

Knowledge graph embedding

Semantics

Attention mechanism

Bayesian inference

Blind deconvolution

Deconvolution

Infrared imaging

Infrared spectroscopy

Learning behavior analysis

Link prediction

RFID

机构署名

本校为第一机构

203

本校为通讯机构

151

本校为第一且通讯机构

140

本校为其他机构

院系归属

国家数字化学习工程技术研究中心

263

计算机学院

信息管理学院

教育信息技术学院

城市与环境科学学院

心理学院

公共管理学院

新闻传播学院

伍伦贡联合研究院

排序：

时间

3/14

每页显示条

请选择

共263条记录，

LDCNet: Limb Direction Cues-aware Network for Flexible Human Pose Estimation in Industrial Behavioral Biometrics Systems

作者： Liu, Tingting;Liu, Hai;Yang, Bing;Zhang, Zhaoli

期刊： IEEE Transactions on Industrial Informatics,2023年:1-11 ISSN：1551-3203

通讯作者： Yang, B;Liu, H

作者机构： [Yang, Bing; Liu, Tingting] Hubei Univ, Sch Educ, 368 Youyi Rd, Wuhan 430062, Hubei, Peoples R China.;[Yang, Bing; Liu, Tingting] City Univ Hong Kong, Dept Mech Engn, Kowloon, Hong Kong, Peoples R China.;[Zhang, Zhaoli; Liu, Hai] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan 430079, Peoples R China.

通讯机构： [Yang, B ] H;[Liu, H ] C;Hubei Univ, Sch Educ, 368 Youyi Rd, Wuhan 430062, Hubei, Peoples R China.;Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan 430079, Peoples R China.

关键词： Biometric authentication;multi-person pose estimation;differentiated Cauchy distribution;industrial behavioral biometrics;deep learning

摘要： 2D Human pose estimation (HPE) has been widely used in the many fields such as behavioral understanding, identity authentication, and industrial automatic manufacturing. Most of the previous studies have encountered many constraints, such as restricted scenarios and strict inputs. To solve this problem, we present a simple yet effective HPE network called limb direction cues-aware network (LDCNet) with limb direction cues and differentiated Cauchy labels, which can efficiently suppress uncertainties and prevent deep networks from over-fitting uncertain keypoint positions. In particular, LDCNet suppresses the uncertainties from two aspects. (1) A differentiated Cauchy coordinate encoding method is designed to reveal the limb direction information among adjacent keypoints. (2) Jeffreys divergence is introduced as loss function to measure the prediction heatmap and ground-truth one. Positions of keypoints are perceived at the limb direction based deep network in an end-to-end manner. An extensive study on two benchmark data sets (i.e., MS COCO and MPII) illustrates the superiority of the proposed LDCNet model over state- of-the-art approaches.

语种：英文

展开

导出

原文链接

认领

A review of visual sustained attention: neural mechanisms and computational models

作者： Huang, Huimin;Li, Rui;Zhang, Junsong

期刊： PEERJ,2023年11:e15351 ISSN：2167-8359

通讯作者： Zhang, JS

作者机构： [Huang, Huimin; Li, Rui] Cent China Normal Univ, Natl Engn Res Ctr Elearning, Wuhan, Hubei, Peoples R China.;[Zhang, Junsong] Xiamen Univ, Sch Informat, Dept Artificial Intelligence, Brain Cognit & Intelligent Comp Lab, Xiamen, Fujian, Peoples R China.

通讯机构： [Zhang, JS ] X;Xiamen Univ, Sch Informat, Dept Artificial Intelligence, Brain Cognit & Intelligent Comp Lab, Xiamen, Fujian, Peoples R China.

关键词： Computational models;Evaluation;Neural mechanisms;Neural pathways;Sustained attention

摘要： Sustained attention is one of the basic abilities of humans to maintain concentration on relevant information while ignoring irrelevant information over extended periods. The purpose of the review is to provide insight into how to integrate neural mechanisms of sustained attention with computational models to facilitate research and application. Although many studies have assessed attention, the evaluation of humans' sustained attention is not sufficiently comprehensive. Hence, this study provides a current review on both neural mechanisms and computational models of visual sustained attention. We first review models, measurements, and neural mechanisms of sustained attention and propose plausible neural pathways for visual sustained attention. Next, we analyze and compare the different computational models of sustained attention that the previous reviews have not systematically summarized. We then provide computational models for automatically detecting vigilance states and evaluation of sustained attention. Finally, we outline possible future trends in the research field of sustained attention.

语种：英文

展开

导出

原文链接

认领

Deep brain stimulation of fornix in Alzheimer's disease: From basic research to clinical practice

作者： Liu, Zhikun;Shu, Kai;Geng, Yumei;Cai, Chang;Kang, Huicong

期刊： European Journal of Clinical Investigation,2023年53(8):e13995 ISSN：0014-2972

作者机构： [Liu, Zhikun; Shu, Kai] Huazhong Univ Sci & Technol, Tongji Hosp, Tongji Med Coll, Dept Neurosurg, Wuhan, Hubei, Peoples R China.;[Kang, Huicong; Geng, Yumei] Huazhong Univ Sci & Technol, Tongji Hosp, Tongji Med Coll, Dept Neurol, Wuhan, Hubei, Peoples R China.;[Cai, Chang] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan, Hubei, Peoples R China.;[Kang, Huicong] 1095 Jiefang Blvd, Wuhan, Hubei, Peoples R China.

关键词： AD;DBS;fornix;functional neurosurgery;neuromodulation;technique consideration

摘要： Alzheimer's disease (AD) is one of the most common progressive neurodegenerative diseases associated with the degradation of memory and cognitive ability. Current pharmacotherapies show little therapeutic effect in AD treatment and still cannot prevent the pathological progression of AD. Deep brain stimulation (DBS) has shown to enhance memory in morbid obese, epilepsy and traumatic brain injury patients, and cognition in Parkinson's disease (PD) patients deteriorates during DBS off. Some relevant animal studies and clinical trials have been carried out to discuss the DBS treatment for AD. Reviewing the fornix trials, no unified conclusion has been reached about the clinical benefits of DBS in AD, and the dementia ratings scale has not been effectively improved in the long term. However, some patients have presented promising results, such as improved glucose metabolism, increased connectivity in cognition-related brain regions and even elevated cognitive function rating scale scores. The fornix plays an important regulatory role in memory, attention, and emotion through its complex fibre projection to cognition-related structures, making it a promising target for DBS for AD treatment. Moreover, the current stereotaxic technique and various evaluation methods have provided references for the operator to select accurate stimulation points. Related adverse events and relatively higher costs in DBS have been emphasized. In this article, we summarize and update the research progression on fornix DBS in AD and seek to provide a reliable reference for subsequent experimental studies on DBS treatment of AD. © 2023 Stichting European Society for Clinical Investigation Journal Foundation. Published by John Wiley & Sons Ltd.

语种：英文

展开

导出

原文链接

认领

Metabolite-disease interaction prediction based on logistic matrix factorization and local neighborhood constraints

作者： Zhao, Yongbiao;Ma, Yuanyuan;Zhang, Qilin

期刊： FRONTIERS IN PSYCHIATRY,2023年14:1149947 ISSN：1664-0640

通讯作者： Ma, YY

作者机构： [Zhao, Yongbiao] Cent China Normal Univ, Natl Engn Res Ctr Elearning, Wuhan, Hubei, Peoples R China.;[Zhao, Yongbiao; Ma, Yuanyuan; Ma, YY; Zhang, Qilin] Hubei Univ Arts & Sci, Sch Comp Engn, Xiangyang, Hubei, Peoples R China.

通讯机构： [Ma, YY ] H;Hubei Univ Arts & Sci, Sch Comp Engn, Xiangyang, Hubei, Peoples R China.

关键词： association prediction;logistic matrix factorization;metabolite-disease interaction;neighborhood regularization;vicus matrix

摘要： Background: Increasing evidence indicates that metabolites are closely related to human diseases. Identifying disease-related metabolites is especially important for the diagnosis and treatment of disease. Previous works have mainly focused on the global topological information of metabolite and disease similarity networks. However, the local tiny structure of metabolites and diseases may have been ignored, leading to insufficiency and inaccuracy in the latent metabolite-disease interaction mining. Methods: To solve the aforementioned problem, we propose a novel metabolite-disease interaction prediction method with logical matrix factorization and local nearest neighbor constraints (LMFLNC). First, the algorithm constructs metabolite-metabolite and disease-disease similarity networks by integrating multi-source heterogeneous microbiome data. Then, the local spectral matrices based on these two networks are established and used as the input of the model, together with the known metabolite-disease interaction network. Finally, the probability of metabolite-disease interaction is calculated according to the learned latent representations of metabolites and diseases. Results: Extensive experiments on the metabolite-disease interaction data were conducted. The results show that the proposed LMFLNC method outperformed the second-best algorithm by 5.28 and 5.61% in the AUPR and F1, respectively. The LMFLNC method also exhibited several potential metabolite-disease interactions, such as “Cortisol” (HMDB0000063), relating to “21-Hydroxylase deficiency,” and “3-Hydroxybutyric acid” (HMDB0000011) and “Acetoacetic acid” (HMDB0000060), both relating to “3-Hydroxy-3-methylglutaryl-CoA lyase deficiency.” Conclusion: The proposed LMFLNC method can well preserve the geometrical structure of original data and can thus effectively predict the underlying associations between metabolites and diseases. The experimental results show its effectiveness in metabolite-disease interaction prediction. Copyright © 2023 Zhao, Ma and Zhang.

语种：英文

展开

导出

原文链接

认领

KBHN: A knowledge-aware bi-hypergraph network based on visual-knowledge features fusion for teaching image annotation

作者： Li, Hao;Wang, Jing;Du, Xu;Hu, Zhuang;Yang, Shuoqiu

期刊： Information Processing & Management,2023年60(1):103106 ISSN：0306-4573

通讯作者： Jing Wang

作者机构： [Yang, Shuoqiu; Li, Hao; Hu, Zhuang; Du, Xu; Wang, Jing] Cent China Normal Univ, Natl Engn Res Ctr Elearning, Wuhan 430079, Peoples R China.;[Yang, Shuoqiu; Li, Hao; Hu, Zhuang; Du, Xu; Wang, Jing] Cent China Normal Univ, Fac Artificial Intelligence Educ, Wuhan 430079, Peoples R China.

通讯机构： [Jing Wang] N;National Engineering Research Center for E-Learning, Central China Normal University, Wuhan 430079, China<&wdkj&>Faculty of Artificial Intelligence in Education, Central China Normal University, Wuhan 430079, China

关键词： Bi-hypergraph network;Intelligent education;Knowledge hypergraph;Teaching image annotation;Visual-knowledge features fusion;Visual-knowledge inconsistency

摘要： Teaching images, as an important auxiliary tool in teaching and learning, are fundamentally different from the general domain images. Besides visually similar images being more likely to share common labels, teaching images also face the challenge of visual-knowledge inconsistency, including intra-knowledge visual difference and inter-knowledge visual similarity. To address the above challenges, we present KBHN, a knowledge-aware bi-hypergraph network, which not only considers coarse-grained visual features, but also extracts fine-grained knowledge features that reflect knowledge intention hidden in teaching images. In detail, a visual hypergraph is constructed to connect images with visual similarity. It further enriches coarse-grained visual features by modeling the high-order visual relations among teaching images. Moreover, a knowledge hypergraph based on typical images is built to aggregate images with similar knowledge information, which innovatively extracts fine-grained knowledge features by modeling high-order knowledge correlations between local regions. Furthermore, a multi-head attention mechanism is adopted to fuse visual-knowledge features for enriching image representation. A teaching image dataset is constructed to train and validate our model, which contains 20744 real-world images annotated with 24 knowledge points. Experimental results demonstrate that KBHN, incorporating visual-knowledge features, achieves state-of-the-art performance compared to existing methods. © 2022 Elsevier Ltd

语种：英文

展开

导出

原文链接

认领

Automated Video Generation of Moving Digits from Text Using Deep Deconvolutional Generative Adversarial Network

作者： Ullah, Anwar;Yu, Xinguo;Numan, Muhammad

期刊： 计算机、材料和连续体(英文),2023年77(2):2359-2383 ISSN：1546-2218

通讯作者： Yu, XG

作者机构： [Ullah, Anwar; Yu, Xinguo; Yu, XG] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan 430079, Peoples R China.;[Numan, Muhammad] Cent China Normal Univ, Wollongong Joint Inst, Wuhan 430079, Peoples R China.

通讯机构： [Yu, XG ] C;Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan 430079, Peoples R China.

关键词： Generative Adversarial Network (GAN);deconvolutional neural network;convolutional neural network;Inception Score (IS);temporal coherence;Frechet Inception Distance (FID);Generative Adversarial Metric (GAM)

摘要： Generating realistic and synthetic video from text is a highly challenging task due to the multitude of issues involved, including digit deformation, noise interference between frames, blurred output, and the need for temporal coherence across frames. In this paper, we propose a novel approach for generating coherent videos of moving digits from textual input using a Deep Deconvolutional Generative Adversarial Network (DD-GAN). The DD-GAN comprises a Deep Deconvolutional Neural Network (DDNN) as a Generator (G) and a modified Deep Convolutional Neural Network (DCNN) as a Discriminator (D) to ensure temporal coherence between adjacent frames. The proposed research involves several steps. First, the input text is fed into a Long Short Term Memory (LSTM) based text encoder and then smoothed using Conditioning Augmentation (CA) techniques to enhance the effectiveness of the Generator (G). Next, using a DDNN to generate video frames by incorporating enhanced text and random noise and modifying a DCNN to act as a Discriminator (D), effectively distinguishing between generated and real videos. This research evaluates the quality of the generated videos using standard metrics like Inception Score (IS), Frechet Inception Distance (FID), Frechet Inception Distance for video (FID2vid), and Generative Adversarial Metric (GAM), along with a human study based on realism, coherence, and relevance. By conducting experiments on Single-Digit Bouncing MNIST GIFs (SBMG), Two-Digit Bouncing MNIST GIFs (TBMG), and a custom dataset of essential mathematics videos with related text, this research demonstrates significant improvements in both metrics and human study results, confirming the effectiveness of DD-GAN. This research also took the exciting challenge of generating preschool math videos from text, handling complex structures, digits, and symbols, and achieving successful results. The proposed research demonstrates promising results for generating coherent videos from textual input.

语种：英文

展开

导出

原文链接

认领

RIECN: learning relation-based interactive embedding convolutional network for knowledge graph

作者： Wang, Wei;Shen, Xiaoxuan;Zhang, Huanyu;Li, Zhifei;Yi, Baolin

期刊： Neural Computing and Applications,2023年35(11):8343-8356 ISSN：0941-0643

通讯作者： Baolin Yi

作者机构： [Shen, Xiaoxuan; Wang, Wei; Zhang, Huanyu; Li, Zhifei; Yi, Baolin] Cent China Normal Univ, Natl Engn Res Ctr Elearning, Wuhan 430079, Peoples R China.

通讯机构： [Baolin Yi] N;National Engineering Research Center for E-Learning, Central China Normal University, Wuhan, China

关键词： Link prediction;Knowledge graph embedding;Convolution neural network;Feature interaction;Complex relations

摘要： Most knowledge graphs(KGs) are large and incomplete graph-structure database, which can be completed by predicting miss links according to the existing knowledge. The mainstream method is knowledge graph embedding (KGE) which is designed to learn low dimensional embedding of entities and relations. However, knowledge graph embedding still faces two major issues: (1) How to generate more expressive embeddings? (2) How to solve semantic polysemy of entities in different relations? In this paper, we propose a novel KG embedding model, RIECN (Relation-based Interactive Embedding Convolutional Network), which achieves high-quality performance and shows some advancements in modeling complex relations. In RIECN, FIR (Feature Interaction Reshaping) method is introduced to increase the feature interactions between entity and relation embeddings to generate more expressive feature maps. In addition, a new method of generating relation-based dynamic convolution filters, RDCF, is proposed. RDCF generates specific relation and hybird-size convolution filters, which enriches the feature maps of each entity improving the accuracy of link prediction task especially in complex relations scenario. We tested the performance of our model on five benchmark datasets. The experimental results show that the RIECN model significantly outperforms recent state-of-the-art models by 0.1–3.2% and 1.1–3.7%, in terms of MMR metric and Hit@1 metric, respectively.

语种：英文

展开

导出

原文链接

认领

GCANet: Geometry cues-aware facial expression recognition based on graph convolutional networks

作者： Wang, Shutong;Zhao, Anran;Lai, Chenghang;Zhang, Qi;Li, Duantengchuan;...

期刊： Journal of King Saud University - Computer and Information Sciences,2023年35(7):101605 ISSN：1319-1578

通讯作者： Zhang, Q

作者机构： [Wang, Xiaoguang; Wang, Shutong] Wuhan Univ, Sch Informat Management, Wuhan 430072, Peoples R China.;[Wang, Shutong] Cent China Normal Univ, Natl Engn Res Ctr Elearning, Wuhan 430079, Peoples R China.;[Zhao, Anran] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430079, Peoples R China.;[Lai, Chenghang] Fudan Univ, Sch Comp Sci, Shanghai 200438, Peoples R China.;[Zhang, Qi; Zhang, Q] Cent China Normal Univ, Sch Informat Management, Wuhan 430079, Peoples R China.

通讯机构： [Zhang, Q ] C;Cent China Normal Univ, Sch Informat Management, Wuhan 430079, Peoples R China.

关键词： Facial expression recognition;Graph convolutional network;Geometry cue;Uncertainty;Emotion label distribution learning

摘要： Facial expression recognition (FER) task in the wild is challenging due to some uncertainties, such as the ambiguity of facial expressions, subjective annotations, and low-quality facial images. A novel model for FER in-the-wild datasets is proposed in this study to solve these uncertainties. The overview of the proposed method is as follows. First, the facial images are grouped into high and low uncertainties by the pre-trained network. The graph convolutional network (GCN) framework is then used for the facial images with low uncertainty to obtain geometry cues, including the relationship among action units (AUs) and the implicit connection between AUs and expressions, which help predict the probability of the underlying emotional label. The emotion label distribution is produced by combining the predicted latent label probability and the given label. For the facial images with high uncertainty, k-nearest neighbor graphs are built to determine the k facial images in the low uncertainty group with the highest similarity to the given facial image. The emotion label distribution of the given image is then replaced by fusing the emotion label distribution based on the distances between the given image and its adjacent images. Finally, the constructed emotion label distribution facilitates training in a straightforward manner using a convolutional neural network framework to identify facial expressions. Experimental results on RAF-DB, FERPlus, AffectNet, and SFEW2.0 datasets demonstrate that the proposed method achieved superior performance compared to state-of-the-art approaches. (c) 2023 The Author(s). Published by Elsevier B.V. on behalf of King Saud University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

语种：英文

展开

导出

原文链接

认领

A Stack-Propagation Framework With Slot Filling for Multi-Domain Dialogue State Tracking

作者： Wang, Yufan;He, Tingting;Mei, Jie;Fan, Rui;Tu, Xinhui

期刊： IEEE Transactions on Neural Networks and Learning Systems,2022年35(1):1240-1254 ISSN：2162-237X

作者机构： [Fan, Rui; Wang, Yufan] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Hubei Prov Key Lab Artificial Intelligence & Smar, Wuhan 430079, Peoples R China.;[Fan, Rui; Tu, Xinhui; Wang, Yufan; Mei, Jie; He, Tingting] Cent China Normal Univ, Natl Language Resources Monitor & Res Ctr Network, Wuhan 430079, Peoples R China.;[Tu, Xinhui; Mei, Jie; He, Tingting] Cent China Normal Univ, Hubei Prov Key Lab Artificial Intelligence & Smar, Natl Language Resources Monitor & Res Ctr Network, Wuhan 430079, Peoples R China.;[Tu, Xinhui; Mei, Jie; He, Tingting] Cent China Normal Univ, Sch Comp, Wuhan 430079, Peoples R China.

关键词： Computational modeling;Context modeling;Deep learning;Detectors;dialogue state tracking (DST);dialogue system;Filling;History;Semantics;slot filling;Task analysis

摘要： Dialogue state tracking (DST) is a core component of task-oriented dialogue systems. Recent works focus mainly on end-to-end DST models that omit the spoken language understanding (SLU) module to directly obtain the dialogue state based on a user’s dialogue. However, the slot information detected by slot filling in SLU is closely tied to the slot–value pair that needs to be updated in DST. Efficient use of the key slot semantic knowledge obtained by slot filling contributes to improving the performance of DST. Based on this idea, we introduce slot filling as a subtask and build an end-to-end joint model to explicitly integrate the slot information detected by slot filling, which further guides DST. In this article, a novel stack-propagation framework with slot filling for multidomain DST is proposed. The stack-propagation framework is introduced to jointly model slot filling and DST. The framework directly feeds the key slot semantic knowledge detected by slot filling into the DST module. In addition, a slot-masked attention mechanism is designed to enable DST to focus on the key slot information obtained by slot filling. When the slot value is updated, a slot–value softcopy mechanism is designed to enhance the influence of the words marked by key slots. Experiments show that our approach outperforms previous methods and performs outstandingly on two benchmark datasets. IEEE

语种：英文

展开

导出

原文链接

认领

EDMF: Efficient Deep Matrix Factorization With Review Feature Learning for Industrial Recommender System

作者： Liu, Hai;Zheng, Chao;Li, Duantengchuan;Shen, Xiaoxuan;Lin, Ke;...

期刊： IEEE Transactions on Industrial Informatics,2022年18(7):4361-4371 ISSN：1551-3203

通讯作者： Li, D.

作者机构： [Zhang, Zhaoli; Liu, Hai] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan 430079, Peoples R China.;[Li, Duantengchuan] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China.;[Zheng, Chao; Shen, Xiaoxuan; Xiong, Neal N.] Cent China Normal Univ, Natl Engn Lab Educ Big Data, Wuhan 430079, Peoples R China.;[Lin, Ke] Harbin Inst Technol, Dept Control Sci & Engn, Shenzhen 518055, Peoples R China.;[Wang, Jiazhang] Northwestern Univ, Evanston, IL 60208 USA.

通讯机构： [Li, D.] S;School of Computer Science, China

关键词： Deep matrix factorization;industrial recommender system;interactivity;L0norm;sparsity property

摘要： Recommendation accuracy is a fundamental problem in the quality of the recommendation system. In this article, we propose an efficient deep matrix factorization (EDMF) with review feature learning for the industrial recommender system. Two characteristics in user's review are revealed. First, interactivity between the user and the item, which can also be considered as the former's scoring behavior on the latter, is exploited in a review. Second, the review is only a partial description of the user's preferences for the item, which is revealed as the sparsity property. Specifically, in the first characteristic, EDMF extracts the interactive features of onefold review by convolutional neural networks with word-attention mechanism. Subsequently, ${L}_{0}$ norm is leveraged to constrain the review considering that the review information is a sparse feature, which is the second characteristic. Furthermore, the loss function is constructed by maximum a posteriori estimation theory, where the interactivity and sparsity property are converted as two prior probability functions. Finally, the alternative minimization algorithm is introduced to optimize the loss functions. Experimental results on several datasets demonstrate that the proposed methods, which show good industrial conversion application prospects, outperform the state-of-the-art methods in terms of effectiveness and efficiency. © 2005-2012 IEEE.

语种：英文

展开

导出

原文链接

认领

MFDNet: Collaborative Poses Perception and Matrix Fisher Distribution for Head Pose Estimation

作者： Liu, Hai;Fang, Shuai;Zhang, Zhaoli;Li, Duantengchuan;Lin, Ke;...

期刊： IEEE Transactions on Multimedia,2022年24:2449-2460 ISSN：1520-9210

通讯作者： Fang, S.

作者机构： [Li, Duantengchuan; Zhang, Zhaoli; Liu, Hai] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan 430079, Peoples R China.;[Fang, Shuai] Cent China Normal Univ, Natl Engn Lab Educ Big Data, Wuhan 430079, Peoples R China.;[Lin, Ke] Harbin Inst Technol, Control Sci & Engn, Shenzhen 150001, Peoples R China.;[Wang, Jiazhang] Northwestern Univ, Evanston, IL 60208 USA.

通讯机构： [Fang, S.] C;Central China Normal University, National Engineering Laboratory For Educational Big Data, Wuhan, China

关键词： Head;Pose estimation;Three-dimensional displays;Measurement;Feature extraction;Uncertainty;Solid modeling;Head pose estimation;Triplet loss;Rotation matrix;Matrix Fisher distribution;Metric learning

摘要： Head pose estimation suffers from several problems, including low pose tolerance under different disturbances and ambiguity arising from common head pose representation. In this study, a robust three-branch model with triplet module and matrix Fisher distribution module is proposed to address these problems. Based on metric learning, the triplet module employs triplet architecture and triplet loss. It is implemented to maximize the distance between embeddings with different pose pairs and minimize the distance between embeddings with same pose pairs. It can learn a highly discriminate and robust embedding related to head pose. Moreover, the rotation matrix instead of Euler angle and unit quaternion is utilized to represent head pose. An exponential probability density model based on the rotation matrix (referred to as the matrix Fisher distribution) is developed to model head rotation uncertainty. The matrix Fisher distribution can further analyze the head pose, and its maximum likelihood obtained using singular value decomposition provides enhanced accuracy. Extensive experiments executed over AFLW2000 and BIWI datasets demonstrate that the proposed model achieves state-of-the-art performance in comparison with traditional methods. © 1999-2012 IEEE.

语种：英文

展开

导出

原文链接

认领

GMDL: Toward precise head pose estimation via Gaussian mixed distribution learning for students’ attention understanding

作者： Liu, Tingting;Yang, Bing;Liu, Hai;Ju, Jianping;Tang, Jianyin;...

期刊： Infrared Physics & Technology,2022年122:104099 ISSN：1350-4495

通讯作者： Liu, H;Ju, JP

作者机构： [Yang, Bing; Liu, Tingting] Hubei Univ, Sch Educ, 368 Youyi Rd, Wuhan 430062, Hubei, Peoples R China.;[Subramanian, Sriram; Liu, Tingting; Zhang, Zhaoli; Liu, Hai] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan 430079, Peoples R China.;[Ju, Jianping] Hubei Business Coll, Sch Artificial Intelligence, Wuhan 430079, Peoples R China.;[Tang, Jianyin] Changchun Univ Sci & Technol, Sch Electromech Engn, Changchun 130022, Peoples R China.;[Liu, Hai] UCL, UCL Interact Ctr, London, England.

通讯机构： [Ju, JP ] H;[Liu, H ] C;Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan 430079, Peoples R China.;Hubei Business Coll, Sch Artificial Intelligence, Wuhan 430070, Peoples R China.

关键词： Attention understanding;Gaussian mixed distribution;Infrared head pose estimation;Infrared imaging;Label learning;Learning behavior analysis;Regularization

摘要： Students’ head pose estimation is a very difficult task since the training data is insufficient for many head pose angles. In this study, we consider each head pose image as a Gaussian mixed distribution other than the traditional hard label, in which the adjacent head pose images can provide supplementary information for the target image. Specifically, the Gaussian mixed distribution covers the current head pose image and its adjacent 8 head pose images. Each label of head pose image describes the similar degree between the current image and its adjacent head pose images. Then, a novel network architecture is proposed by constructing the Gaussian mixed distribution which learns more discriminative facial features. The extensive evaluations on two public HPE databases show that the proposed GMDL model obtains the better performance compared with the conventional algorithms. In practice, the proposed model can be utilized to estimate learners’ head pose angle for attention understanding in the instruction and learning scenarios. © 2022 Elsevier B.V.

语种：英文

展开

导出

原文链接

认领

Dual-position features fusion for head pose estimation for complex scene

作者： Zhu, Xiaoliang;Yang, Qiaolai;Zhao, Liang;Dai, Zhicheng;He, Zili;...

期刊： Optik,2022年270:169986 ISSN：0030-4026

通讯作者： Liang Zhao<&wdkj&>Zhicheng Dai

作者机构： [Rong, Wenting; He, Zili; Zhao, Liang; Yang, Qiaolai; Zhu, Xiaoliang] Cent China Normal Univ, Natl Engn Res Ctr Educ Big Data, Wuhan 430079, Peoples R China.;[Rong, Wenting; He, Zili; Dai, Zhicheng] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan 430079, Peoples R China.

通讯机构： [Liang Zhao; Zhicheng Dai] N;National Engineering Research Center for E-Learning, Central China Normal University, WuHan 430079, PR China<&wdkj&>National Engineering Research Center for Educational Big Data, Central China Normal University, WuHan 430079, PR China

关键词： Head pose estimation;Standard luminance;Center offset loss;Border adjustment;Feature fusion

摘要： Head pose estimation (HPE) is widely used in attention detection, behavior analysis, and expression recognition. Nevertheless, in some complex scenes (such as facial occlusion, large head deflection angle, and multi-person in one scene), HPE still has the problem of low estimation accuracy. To solve this problem, we propose a dual position feature fusion method for estimating head pose. First, the RGB input is replaced with a standard luminance, which reduces the effect of extraneous light factors. Subsequently, the center offset loss is used to detect the head and body position, and dynamic adjustment strategy is used to deflate the border, aiming to not only obtain the best confidence level but also improve the capability of multi-person HPE. Finally, the esti-mate results under head position and body position are fused to further reduce the estimate loss. We tested our approach on the popular public AFLW2000, BIWI, and UPNA datasets, the results show the superiority of our approach in solving the occlusion, deflection, and multi-person scene problems.

语种：英文

展开

导出

原文链接

认领

Electromagnetic Source Imaging via a Data-Synthesis-Based Convolutional Encoder-Decoder Network

作者： Huang, Gexin;Liu, Ke;Liang, Jiawen;Cai, Chang;Gu, Zheng Hui;...

期刊： IEEE Transactions on Neural Networks and Learning Systems,2022年PP:1-15 ISSN：2162-237X

作者机构： [Huang, Gexin; Gu, Zheng Hui; Li, Yuanqing; Yu, Zhu Liang] South China Univ Technol, Sch Automat Sci & Engn, Guangzhou 510641, Peoples R China.;[Liu, Ke] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Computat Intelligence, Chongqing 400065, Peoples R China.;[Liang, Jiawen] South China Univ Technol, Sch Intelligent Engn, Guangzhou 510641, Peoples R China.;[Cai, Chang] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan 430079, Peoples R China.;[Qi, Feifei] Guangdong Univ Finance, Sch Internet Finance & Informat Engn, Guangzhou 510521, Peoples R China.

关键词： Training;Spatiotemporal phenomena;Electromagnetics;Deep learning;Convolution;Magnetic resonance imaging;Inverse problems;Convolutional encoder-decoder network (CedNet);data synthesis;deep learning;electromagnetic source imaging (ESI)

摘要： Electromagnetic source imaging (ESI) requires solving a highly ill-posed inverse problem. To seek a unique solution, traditional ESI methods impose various forms of priors that may not accurately reflect the actual source properties, which may hinder their broad applications. To overcome this limitation, in this article, a novel data-synthesized spatiotemporally convolutional encoder-decoder network (DST-CedNet) method is proposed for ESI. The DST-CedNet recasts ESI as a machine learning problem, where discriminative learning and latent-space representations are integrated in a CedNet to learn a robust mapping from the measured electroencephalography/magnetoencephalography (E/MEG) signals to the brain activity. In particular, by incorporating prior knowledge regarding dynamical brain activities, a novel data synthesis strategy is devised to generate large-scale samples for effectively training CedNet. This stands in contrast to traditional ESI methods where the prior information is often enforced via constraints primarily aimed for mathematical convenience. Extensive numerical experiments as well as analysis of a real MEG and epilepsy EEG dataset demonstrate that the DST-CedNet outperforms several state-of-the-art ESI methods in robustly estimating source signals under a variety of source configurations.

语种：英文

展开

导出

原文链接

认领

Multi-relational graph attention networks for knowledge graph completion

作者： Li, Zhifei;Zhao, Yue;Zhang, Yan;Zhang, Zhaoli

期刊： Knowledge-Based Systems,2022年251:109262 ISSN：0950-7051

通讯作者： Zhifei Li<&wdkj&>Yan Zhang

作者机构： [Zhao, Yue; Li, Zhifei; Zhang, Yan] Hubei Univ, Sch Comp Sci & Informat Engn, Wuhan 430062, Peoples R China.;[Zhang, Zhaoli] Cent China Normal Univ, Natl Engn Res Ctr Elearning, Wuhan 430079, Peoples R China.

通讯机构： [Zhifei Li; Yan Zhang] S;School of Computer Science and Information Engineering, Hubei University, Wuhan 430062, China

关键词： Multi-relational learning;Knowledge graph completion;Graph neural network;Attention mechanism

摘要： Knowledge graphs are multi-relational data that contain massive entities and relations. As an effective graph representation technique based on deep learning, graph neural network has reported outstand-ing performance for modeling knowledge graphs in recent studies. However, previous graph neural network-based models have not fully considered the heterogeneity of knowledge graphs. Furthermore, the attention mechanism has demonstrated its great potential in many areas. In this paper, a novel heterogeneous graph neural network framework based on a hierarchical attention mechanism is proposed, including entity-level, relation-level, and self-level attentions. Thus, the proposed model can selectively aggregate informative features and weights them adequately. Then the learned embeddings of entities and relations can be utilized for the downstream tasks. Extensive experimental results on various heterogeneous graph tasks demonstrate the superior performance of the proposed model compared to several state-of-the-art methods. (C) 2022 Elsevier B.V. All rights reserved.

语种：英文

展开

导出

原文链接

认领

Learning Knowledge Graph Embedding With Heterogeneous Relation Attention Networks

作者： Li, Zhifei;Liu, Hai*;Zhang, Zhaoli;Liu, Tingting;Xiong, Neal N.

期刊： IEEE Transactions on Neural Networks and Learning Systems,2022年33(8):3961-3973 ISSN：2162-237X

通讯作者： Liu, Hai

作者机构： [Zhang, Zhaoli; Li, Zhifei; Liu, Hai] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan 430079, Peoples R China.;[Liu, Tingting] Hubei Univ, Sch Educ, Wuhan 430062, Peoples R China.;[Xiong, Neal N.] Cent China Normal Univ, Natl Engn Lab Educ Big Data, Wuhan 430079, Peoples R China.;[Xiong, Neal N.] Northeastern State Univ, Dept Math & Comp Sci, Tahlequah, OK 74464 USA.

通讯机构： [Liu, Hai] C;Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan 430079, Peoples R China.

关键词： Semantics;Task analysis;Aggregates;Graph neural networks;Computer architecture;Learning systems;Fuses;Graph heterogeneity;graph neural networks (GNNs);knowledge graph (KG) embedding;KGs;link prediction

摘要： Knowledge graph (KG) embedding aims to study the embedding representation to retain the inherent structure of KGs. Graph neural networks (GNNs), as an effective graph representation technique, have shown impressive performance in learning graph embedding. However, KGs have an intrinsic property of heterogeneity, which contains various types of entities and relations. How to address complex graph data and aggregate multiple types of semantic information simultaneously is a critical issue. In this article, a novel heterogeneous GNNs framework based on attention mechanism is proposed. Specifically, the neighbor features of an entity are first aggregated under each relation-path. Then the importance of different relation-paths is learned through the relation features. Finally, each relation-path-based features with the learned weight values are aggregated to generate the embedding representation. Thus, the proposed method not only aggregates entity features from different semantic aspects but also allocates appropriate weights to them. This method can capture various types of semantic information and selectively aggregate informative features. The experiment results on three real-world KGs demonstrate superior performance when compared with several state-of-the-art methods.

语种：英文

展开

导出

原文链接

认领

Multi-Scale Dynamic Convolutional Network for Knowledge Graph Embedding

作者： Zhang, Zhaoli;Li, Zhifei;Liu, Hai;Xiong, Neal N.

期刊： IEEE Transactions on Knowledge and Data Engineering,2022年34(5):2335-2347 ISSN：1041-4347

通讯作者： Li, ZF

作者机构： [Li, Zhifei; Zhang, Zhaoli; Liu, Hai] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan 430079, Hubei, Peoples R China.;[Xiong, Neal N.] Cent China Normal Univ, Natl Engn Lab Educ Big Data, Wuhan 430079, Hubei, Peoples R China.

通讯机构： [Li, ZF ] C;Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan 430079, Hubei, Peoples R China.

关键词： Computational modeling;Convolution;Semantics;Predictive models;Feature extraction;Knowledge engineering;Computer architecture;Knowledge graphs;knowledge graph embedding;complex relations;link prediction;convolutional network

摘要： Knowledge graphs are large graph-structured knowledge bases with incomplete or partial information. Numerous studies have focused on knowledge graph embedding to identify the embedded representation of entities and relations, thereby predicting missing relations between entities. Previous embedding models primarily regard (subject entity, relation, and object entity) triplet as translational distance or semantic matching in vector space. However, these models only learn a few expressive features and hard to handle complex relations, i.e., 1-to-N, N-to-1, and N-to-N, in knowledge graphs. To overcome these issues, we introduce a multi-scale dynamic convolutional network (M-DCN) model for knowledge graph embedding. This model features topnotch performance and an ability to generate richer and more expressive feature embeddings than its counterparts. The subject entity and relation embeddings in M-DCN are composed in an alternating pattern in the input layer, which helps extract additional feature interactions and increase the expressiveness. Multi-scale filters are generated in the convolution layer to learn different characteristics among input embeddings. Specifically, the weights of these filters are dynamically related to each relation to model complex relations. The performance of M-DCN on the five benchmark datasets is tested via experiments. Results show that the model can effectively handle complex relations and achieve state-of-the-art link prediction results on most evaluation metrics. © 1989-2012 IEEE.

语种：英文

展开

导出

原文链接

认领

Factors associated with teachers' competence to develop students’ information literacy: A multilevel approach

作者： Wu, Di;Zhou, Chi;Li, Yating;Chen, Min*

期刊： Computers & Education,2022年176:104360 ISSN：0360-1315

通讯作者： Chen, Min

作者机构： [Li, Yating; Zhou, Chi; Chen, Min; Wu, Di] Cent China Normal Univ, Natl Engn Res Ctr E Learning, 152 LuoYu St, Wuhan 430079, Peoples R China.

通讯机构： [Chen, Min] C;Cent China Normal Univ, Natl Engn Res Ctr E Learning, 152 LuoYu St, Wuhan 430079, Peoples R China.

关键词： Elementary education;Information literacy;Pedagogical issues;Secondary education;Teacher professional development

摘要： Cultivating students' information literacy is becoming increasingly important for teachers in the 21st century; however, teachers' competence to develop students' information literacy (TCDSIL) is far from satisfying. To foster TCDSIL, its influencing factors and promotion strategies need to be studied. However, prior studies on TCDSIL mainly investigate the influencing factors from a single-level perspective and neglect the potential relationship of school context on TCDSIL. In order to fill this gap and provide a deeper understanding of the complex system of TCDSIL, this study surveyed 9909 teachers in 1286 primary and secondary schools and used a two-level hierarchical linear model to analyze the survey data. The analysis results indicated that both teacher characteristics and school context have a significant relationship with TCDSIL. Among school-related factors, school type, resources for instruction, and network bandwidth have positively significant relationships with TCDSIL. Moreover, teachers' perceived usefulness, information processing skills (the skills of information access, information usage, and information management), and information ethics could predict TCDSIL. This study provides implications regarding how to improve TCDSIL, including paying attention to the gap between primary school teachers and secondary school teachers; enriching school's digital teaching resources; ensuring school network quality; enhancing teachers' perceived usefulness of ICT, information processing skills, and information ethics. © 2021 Elsevier Ltd

语种：英文

展开

导出

原文链接

认领

Spatial Interference Alignment with Limited Precoding Matrix Feedback in a Wireless Multi-User Interference Channel for Smart Grids

作者： Peng, Shixin;Chen, Xiaohui;Lu, Wei;Deng, Chao;Chen, Jingying

期刊： Energies,2022年15(5) ISSN：1996-1073

通讯作者： Chen, JY

作者机构： [Peng, Shixin; Chen, Jingying] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Natl Engn Lab Educ Big Data, Wuhan 430079, Peoples R China.;[Chen, Xiaohui] State Grid Hunan Elect Power Co Ltd, Informat & Commun Branch, Changsha 410004, Peoples R China.;[Lu, Wei] Air Force Early Warning Acad, Wuhan 430019, Peoples R China.;[Deng, Chao] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China.

通讯机构： [Chen, JY ] C;Cent China Normal Univ, Natl Engn Res Ctr E Learning, Natl Engn Lab Educ Big Data, Wuhan 430079, Peoples R China.

关键词： Interference alignment;Limited feedback;MIMO;Precoding matrix;Smart grid;Spatial multiplexing gain

摘要： Cellular communication provides an efficient, flexible, long-lived, and reliable communication technology for smart grids to improve the automated analysis, demand response, adoptive control, and coordination between the generator and consumers. With the expansion of wireless networks and the increase of access devices, interference has become a major problem that limits the performance of cellular wireless communication systems for smart grids. Spatial interference alignment (IA) is an effective method to eliminate interference and improve the capacity of wireless communication networks. This paper provides the sufficient conditions of spatial interference alignment operating with limited precoding matrix feedback for a K-user MIMO interference channel. Each receiver feeds the matrix index of the transmitting precoder back to the corresponding transmitter through an interference-free and error-free link. We calculated the number of feedback bits required to achieve the maximum theoretical multiplexing gain for the spatial interference alignment schemes considered and demonstrate the feasibility of spatial interference alignment under the limited feedback constraint investigated. It is shown that in order to maintain the same spatial multiplexing gain as that of the idealized scheme relying on perfect channel state information, the number of feedback bits per receiver scales as Nd ≥ di (M − di ) log2 SNR, where M and di denote the number of transmit (receive) antennas and the number of data steams for user i. Finally, the analytical results were verified by simulations for practical interference alignment schemes relying on limited precoding matrix feedback indices. © 2022 by the authors. Licensee MDPI, Basel, Switzerland.

语种：英文

展开

导出

原文链接

认领

VMV-GCN: Volumetric Multi-View Based Graph CNN for Event Stream Classification

作者： Xie, Bochen;Deng, Yongjian;Shao, Zhanpeng;Liu, Hai;Li, Youfu

期刊： IEEE ROBOTICS AND AUTOMATION LETTERS,2022年7(2):1976-1983 ISSN：2377-3766

通讯作者： Li, YF

作者机构： [Li, Youfu; Xie, Bochen; Deng, Yongjian] City Univ Hong Kong, Dept Mech Engn, Hong Kong, Peoples R China.;[Shao, Zhanpeng] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou 310023, Peoples R China.;[Liu, Hai] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan 430079, Peoples R China.

通讯机构： [Li, YF ] C;City Univ Hong Kong, Dept Mech Engn, Hong Kong, Peoples R China.

关键词： Brightness;Cameras;Complexity theory;Feature extraction;Representation learning;Streaming media;Task analysis

摘要： Event cameras can perceive pixel-level brightness changes to output asynchronous event streams, and have notable advantages in high temporal resolution, high dynamic range and low power consumption for challenging vision tasks. To apply existing learning models on event data, many researchers integrate sparse events into dense frame-based representations which can work with convolutional neural networks directly. Although these works achieve high performance on event-based classification, their models need lots of parameters to process dense event frames which do not fit with the sparsity of event data. To utilize the sparse nature of events, we propose a voxel-wise graph learning model (<italic>VMV-GCN</italic>) for spatio-temporal feature learning on event streams. Specifically, we design the volumetric multi-view fusion module (<italic>VMVF</italic>) to extract spatial and temporal information from views of voxelized event data. Then we take representative event voxels as vertices and use a novel dual-graph construction strategy to connect them. By aggregating neighborhood information based on relationships of vertices, the proposed dynamic neighborhood feature learning module (<italic>DNFL</italic>) can capture discriminative spatio-temporal features on dynamically updated graphs. Experiments show that our method achieves state-of-the-art performance with low model complexity on event-based classification tasks, such as object classification and action recognition. © 2022 IEEE.

语种：英文

展开

导出

原文链接

认领

1 234 5 6 7... 14 共 14 页

成果认领

标题：

用户	作者	通讯作者	--
	请选择	请选择	--

确定

取消

国家数字化学习工程技术研究中心

成果认领

提示

该栏目需要登录且有访问权限才可以访问