李波 - 计算机学院 - 学者 - 华中师范大学智汇云

首页 > 学者 > 学者详情

李波

计算机学院

研究方向：大数据应用；概率统计

个人成果详细资料

QQ 微信微博

疑似成果

成果类型

请选择成果类型

全部

期刊论文

会议论文

项目

筛选

开始检索

已无可筛选条件

成果类型

期刊论文

会议论文

年份 (2013~2024)

年

2024

2022

2020

2019

2018

2017

2014

2013

语种

英文

中文

期刊

International Conference on Information and Knowledge Management, Proceedings

Lecture Notes in Computer Science

Methods

Pattern Recognition

2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA)

Artificial Intelligence Review

Automated Software Engineering

IEEE Transactions on Circuits and Systems for Video Technology

IET Software

INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING

Information Processing & Management

Information Sciences

MOBILE NETWORKS & APPLICATIONS

NATURAL LANGUAGE ENGINEERING

Signal Processing: Image Communication

学校党建与思想教育

小型微型计算机系统

计算机工程与应用

计算机工程与科学

作者

Li, Baoxin

Xie, Wei

Yuan, Junsong

Tu, Zhigang

李波

He, Tingting

Veltkamp, Remco C.

Li, Bo

Gaussier, Eric

Jiang, Xingpeng

Li, Bo

Luo, Jing

Poppe, Ronald

Yang, Dan

Bing Li

Bo Li

CHEN Peng

Can Cheng

LI Bo

Peng Liang

关键词

Optical flow

可比语料库

词向量

Artificial intelligence

Bilingual corpora

Bilingual dictionary

Bilingual lexicon extractions

Community question answering

Comparable corpora

Cross-language question retrieval

D/C condition

Gold standards

Graph normalization

Host phenotype prediction

Information retrieval heuristic

Metagenomics

Real-world

Under-resourced languages

multi-modalities

multi-stream CNN

机构署名

本校为第一机构

本校为通讯机构

本校为其他机构

本校为第一且通讯机构

院系归属

计算机学院

马克思主义学院

政治与国际关系学院

信息管理学院

国家数字化学习工程技术研究中心

排序：

时间

1/2

每页显示条

请选择

共23条记录，

基于位置增强词向量和GRU-CNN的方面级情感分析模型研究

作者：陶林娟;华庚兴;李波

期刊： 计算机工程与应用,2024年60(09):212-218 ISSN：1002-8331

作者机构：华中师范大学计算机学院,武汉 430079;[李波; 华庚兴; 陶林娟] 华中师范大学

关键词：方面级情感分析;卷积神经网络;预训练词向量;位置函数;注意力机制

摘要：方面级情感分析旨在判断一段文本中特定方面词的情感倾向，其核心问题是方面词的上下文如何准确表征。与现有研究主要关注注意力机制的改进不同，该文从词语表征和上下文编码模型两个方面进行改进。在词语表征方面，通过BERT模型和位置度量公式获得增强的词向量表示；在上下文编码模型方面，使用GRU-CNN网络提取文本语义特征。在SemEval2014 Task4数据集上的实验表明，提出的模型在Restaurant和Laptop领域中的准确率分别达到了85.54%和80.35%，证实了所提出模型的有效性。

语种：中文

展开

导出

原文链接

认领

Optical flow for video super-resolution: a survey

作者： Tu, Zhigang;Li, Hongyan;Xie, Wei;Liu, Yuanzhong;Zhang, Shifu;...

期刊： Artificial Intelligence Review,2022年55(8):6505-6546 ISSN：0269-2821

通讯作者： Tu, Zhigang(tuzhigang@whu.edu.cn)

作者机构： [Zhang, Shifu; Liu, Yuanzhong; Tu, Zhigang] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan, Peoples R China.;[Li, Hongyan] Hubei Univ Econ, Sch Informat Engn, Wuhan, Peoples R China.;[Xie, Wei] Cent China Normal Univ, Sch Comp, Wuhan, Peoples R China.;[Zhang, Shifu; Li, Baoxin] Arizona State Univ, Sch Comp Informat Decis Syst Engn, Tempe, AZ USA.;[Zhang, Shifu; Yuan, Junsong] SUNY Buffalo, Comp Sci & Engn Dept, Buffalo, NY USA.

通讯机构： [Zhigang Tu] S;State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan, China

关键词： Video super-resolution;Optical flow;Optical Flow-based video super-resolution;Temporal dependency

摘要： Video super-resolution is currently one of the most active research topics in computer vision as it plays an important role in many visual applications. Generally, video super-resolution contains a significant component, i.e., motion compensation, which is used to estimate the displacement between successive video frames for temporal alignment. Optical flow, which can supply dense and sub-pixel motion between consecutive frames, is among the most common ways for this task. To obtain a good understanding of the effect that optical flow acts in video super-resolution, in this work, we conduct a comprehensive review on this subject for the first time. This investigation covers the following major topics: the function of super-resolution (i.e., why we require super-resolution); the concept of video super-resolution (i.e., what is video super-resolution); the description of evaluation metrics (i.e., how (video) super-resolution performs); the introduction of optical flow based video super-resolution; the investigation of using optical flow to capture temporal dependency for video super-resolution. Prominently, we give an in-depth study of the deep learning based video super-resolution method, where some representative algorithms are analyzed and compared. Additionally, we highlight some promising research directions and open issues that should be further addressed. © 2022, The Author(s), under exclusive licence to Springer Nature B.V.

语种：英文

展开

导出

原文链接

认领

GNPI: Graph normalization to integrate phylogenetic information for metagenomic host phenotype prediction

作者： Li, Bojing;Zhong, Duo;Qiao, Jimei;Jiang, Xingpeng

期刊： Methods,2022年205:11-17 ISSN：1046-2023

通讯作者： Xingpeng Jiang

作者机构： [Zhong, Duo; Jiang, Xingpeng; Li, Bojing] Cent China Normal Univ, Hubei Key Lab Artificial Intelligence & Smart Lear, Wuhan, Peoples R China.;[Zhong, Duo; Jiang, Xingpeng; Li, Bojing] Cent China Normal Univ, Sch Comp, Wuhan, Peoples R China.;[Qiao, Jimei] Shanghai Normal Univ, Math & Sci Coll, Shanghai, Peoples R China.;[Jiang, Xingpeng] Cent China Normal Univ, Natl Language Resources Monitoring & Res Ctr Netwo, Wuhan, Peoples R China.

通讯机构： [Xingpeng Jiang] H;Hubei Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan, China<&wdkj&>School of Computer, Central China Normal University, Wuhan, China<&wdkj&>National Language Resources Monitoring & Research Center for Network Media, Central China Normal University, Wuhan, China

关键词： Article;artificial neural network;bioinformatics;controlled study;convolutional neural network;data accuracy;data processing;deep learning;graph convolutional network;graph normalization based phylogenetic information;machine learning;metagenomics;phenotype;phylogenetic tree;prediction;human;machine learning;metagenome;phenotype;phylogeny;procedures;Humans;Machine Learning;Metagenome;Metagenomics;Phenotype;Phylogeny

摘要： Microorganisms play important roles in our lives especially on metabolism and diseases. Determining the probability of human suffering from specific diseases and the severity of the disease based on microbial genes is the crucial research for understanding the relationship between microbes and diseases. Previous could extract the topological information of phylogenetic trees and integrate them to metagenomic datasets, thus enable classifiers to learn more information in limited datasets and thus improve the performance of the models. In this paper, we proposed a GNPI model to better learn the structure of phylogenetic trees. GNPI maintained the original vector format of metagenomic datasets, while previous research had to change the input form to matrices. The vector-like form of the input data can be easily adopted in the baseline machine learning models and is available for deep learning models. The datasets processed with GNPI help enhance the accuracy of machine learning and deep learning models in three different datasets. GNPI is an interpretable data processing method for host phenotype prediction and other bioinformatics tasks.

语种：英文

展开

导出

原文链接

认领

An in-depth study of the effects of methods on the dataset selection of public development projects

作者： Cheng, Can;Li, Bing*;Li, Zengyang;Liang, Peng;Yang, Xu

期刊： IET Software,2022年16(2):146-166 ISSN：1751-8806

通讯作者： Li, Bing

作者机构： [Cheng, Can; Liang, Peng; Li, Bing] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China.;[Li, Zengyang] Cent China Normal Univ, Sch Comp Sci, Wuhan, Peoples R China.;[Yang, Xu] Huawei Technol, Nanjing, Peoples R China.

通讯机构： [Li, Bing] W;Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China.

关键词： data mining;software engineering

摘要： Public development projects (PDPs) and documented public development projects (DPDPs) are two types of projects that can provide valuable information on how developers and users participate in OSS projects. However, it is hard for researchers to effectively select PDPs and DPDPs due to the lack of specific project selection methods for these two types of projects. To address this problem, a standard dataset was labelled and the base line methods (i.e. selecting projects according to a single feature like star number) under 60 configurations and the machine learning methods under 18 configurations were tested to identify the best configurations in precision and F-measure for selecting PDPs and DPDPs. The results show that (1) to select PDPs or DPDPs with a high precision, the base line method is the best with precision of 0.877 (PDPs) and 0.831 (DPDPs); (2) to select PDPs or DPDPs with a high F-measure, the machine learning methods are the best, with F-measure of 0.817 (PDPs) and 0.789 (DPDPs); (3) existing sample selection strategies can be combined with the machine learning methods, and the precision of selecting PDPs can be increased by 6.39%–41.33% and the precision of selecting DPDPs can be can be increased by 35.50%–269.02%. © 2021 The Authors. IET Software published by John Wiley& Sons Ltd on behalf of The Institution of Engineering and Technology.

语种：英文

展开

导出

原文链接

认领

Improving generality and accuracy of existing public development project selection methods: a study on GitHub ecosystem

作者： Cheng, Can;Li, Bing*;Li, Zengyang;Liang, Peng;Han, Xiaofeng;...

期刊： Automated Software Engineering,2022年29(1):1-43 ISSN：0928-8910

通讯作者： Li, Bing;Liang, P

作者机构： [Han, Xiaofeng; Zhang, Jiahua; Cheng, Can; Liang, Peng; Li, Bing; Li, B] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China.;[Li, Zengyang] Cent China Normal Univ, Sch Comp Sci, Wuhan, Peoples R China.

通讯机构： [Liang, P ; Li, B] W;Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China.

关键词： Open source software project;GitHub;Public development project

摘要： With available tools and datasets existing on GitHub ecosystem, researchers have the opportunities to study diverse software engineering problems on a large-scale dataset. However, there are many potential threats when researchers try to directly use large-scale datasets, and one important threat is that GitHub contains many private projects (e.g., homework) and non-development projects (e.g., blog). For researchers who want to study cooperative behavior of developers or development process of projects, their research samples should not contain private projects and non-development projects. To solve this problem, we first analyzed the weaknesses of the base line methods (i.e., selecting top projects) and extended ML-based methods (i.e., training models on a labeled training dataset using ML algorithms, Extended_MLMs for short), and proposed two methods called Enhanced_RFM and Fusion_DL_RFM to address the weaknesses of Extended_RFM (the Extended_MLM that is based on Random Forest and has the best performance among all the Extended_MLMs). The results show that: (1) existing project sample selection methods have a low F-measure and poor generality (i.e., have a bad performance on the testing dataset); (2) Enhanced_RFM outperforms Fusion_DL_RFM on accuracy and stability; and (3) by adopting Enhanced_RFM, the F-measure of Extended_RFM is improved from 0.690 to 0.810 and the precision of Extended_RFM is improved from 0.559 to 0.785 under cross validation, which indicates that the generality of Extended_RFM is significantly improved. © 2022, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.

语种：英文

展开

导出

原文链接

认领

Cross-language question retrieval with multi-layer representation and layer-wise adversary

作者： Li, Bo*;Du, Xiaodong;Chen, Meng

期刊： Information Sciences,2020年527:241-252 ISSN：0020-0255

通讯作者： Li, Bo

作者机构： [Li, Bo; Du, Xiaodong; Chen, Meng] Cent China Normal Univ, Sch Comp Sci, Wuhan, Peoples R China.

通讯机构： [Li, Bo] C;Cent China Normal Univ, Sch Comp Sci, Wuhan, Peoples R China.

关键词： Adversarial learning;Community question answering;Cross-language question retrieval

摘要： In cross-language question retrieval (CLQR), users employ a new question in one language to search the community question answering (CQA) archives for similar questions in another language. In addition to the ranking problem in monolingual question retrieval, one needs to bridge the language gap in CLQR. The existing adversarial models for cross-language learning normally rely on a single adversarial component. Since natural languages consist of units of different abstract levels, we argue that crossing the language gap adaptatively on different levels with multiple adversarial components should lead to smoother text representation and better CLQR performance. To this end, we first encode questions into multi-layer representations of different abstract levels with a CNN based model which enhances conventional models with diverse kernel shapes and the corresponding pooling strategy so as to capture different aspects of a text segment. We then impose a set of adversarial components on different layers of question representation so as to decide the appropriate abstract levels and their role in performing cross-language mapping. Experimental results on two real-world datasets demonstrate that our model outperforms state-of-the-art models for CLQR, which is on par with the strong machine translation baselines and most monolingual baselines. (C) 2020 Elsevier Inc. All rights reserved.

语种：英文

展开

导出

原文链接

认领

Is Bug Severity in Line with Bug Fixing Change Complexity?

作者： Li, Zengyang;Liang, Peng*;Li, Dengwei;Mo, Ran;Li, Bing

期刊： INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING,2020年30(11-12):1779-1800 ISSN：0218-1940

通讯作者： Liang, Peng

作者机构： [Li, Zengyang; Mo, Ran; Li, Dengwei] Cent China Normal Univ, Sch Comp Sci, Wuhan 430079, Peoples R China.;[Li, Zengyang; Mo, Ran; Li, Dengwei] Cent China Normal Univ, Hubei Prov Key Lab Artificial Intelligence & Smar, Wuhan 430079, Peoples R China.;[Liang, Peng; Li, Bing] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China.

通讯机构： [Liang, Peng] W;Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China.

关键词： Bug severity;code change complexity;commit record

摘要： Both complexity of code change for bug fixing and bug severity play an important role in release planning when considering which bugs should be fixed in a specific release under certain constraints. This work investigates whether there are significant differences between bugs of different severity levels regarding the complexity of code change for fixing the bugs. Code change complexity is measured by the number of modified lines of code, source files, and packages, as well as the entropy of code change. We performed a case study on 20 Apache open source software (OSS) projects using commit records and bug reports. The study results show that (1) for bugs of high severity levels (i.e. Blocker, Critical and Major in JIRA), there is no significant difference on the complexity of code change for fixing bugs of different severity levels for most projects, while (2) for bugs of low severity levels (i.e. Major, Minor and Trivial in JIRA), fixing bugs of a higher severity level needs significantly more complex code change than fixing bugs of a lower severity level for most projects. These findings provide useful and practical insights for effort estimation and release planning of OSS development.

语种：英文

展开

导出

原文链接

认领

Robust biomarker discovery for microbiome-wide association studies

作者： Zhu, Qiang;Li, Bojing;He, Tingting（何婷婷）;Li, Guangrong;Jiang, Xingpeng*（蒋兴鹏）

期刊： Methods,2020年173:44-51 ISSN：1046-2023

通讯作者： Jiang, Xingpeng

作者机构： [Zhu, Qiang] Cent China Normal Univ, Sch Informat Management, Wuhan, Hubei, Peoples R China.;[Jiang, Xingpeng; He, Tingting; Li, Bojing] Cent China Normal Univ, Sch Comp, 152 Luoyu Rd, Wuhan, Hubei, Peoples R China.;[Jiang, Xingpeng; He, Tingting; Zhu, Qiang; Li, Bojing] Cent China Normal Univ, Hubei Key Lab Artificial Intelligence & Smart Lea, Wuhan, Hubei, Peoples R China.;[Li, Guangrong] Hunan Univ, Sch Business, Changsha, Hunan, Peoples R China.

通讯机构： [Jiang, Xingpeng] C;Cent China Normal Univ, Sch Comp, 152 Luoyu Rd, Wuhan, Hubei, Peoples R China.

关键词： biological marker;biological marker;Article;convolutional neural network;deep forest;deep learning;deep neural network;gene sequence;high throughput sequencing;information processing;k nearest neighbor;logistic regression analysis;mathematical computing;metagenomics;microbiome;priority journal;support vector machine;genetics;genome-wide association study;human;medical research;microflora;procedures;Biomarkers;Biomedical Research;Genome-Wide Association Study;Humans;Microbiota;Neural Networks, Computer

摘要： According to the advances of high-throughput sequencing technology, massive microbiome data accumulated from environmental investigations to human studies. The microbiome-wide association studies are to study the relationship between the microbiome and human health or environment. Recently, Deep Neural Networks (DNNs) are encouraging due to their layer-wise learning ability for representation learning. However, DNNs are considered as black boxes and they require a large amount of training data which makes them impractical to conduct microbiome-wide association studies directly. Meanwhile, the microbiome data is high dimension with many features and noise. A single feature selection method for dealing with the kind of dataset is often unstable. In this work, we introduced a deep learning model named Deep Forest to conduct the microbiome-wide association studies and an ensemble feature selection method is proposed to guide microbial biomarkers’ identification. The experiments showed that our ensemble feature method based on Deep Forest had good stability and robustness. The results of feature selection could guide the discovery of microbial biomarkers and help to diagnose microbial-related diseases. The code is available at https://github.com/MicroAVA/MWAS-Biomarkers.git. © 2019 Elsevier Inc.

语种：英文

展开

导出

原文链接

认领

A survey of variational and CNN-based optical flow techniques

作者： Tu, Zhigang*;Xie, Wei*（谢伟）;Zhang, Dejun;Poppe, Ronald;Veltkamp, Remco C.;...

期刊： Signal Processing: Image Communication,2019年72:9-24 ISSN：0923-5965

通讯作者： Tu, Zhigang;Xie, Wei

作者机构： [Tu, Zhigang] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430079, Hubei, Peoples R China.;[Xie, Wei] Cent China Normal Univ, Sch Comp, LuoyuRd 152, Wuhan, Hubei, Peoples R China.;[Zhang, Dejun] China Univ Geosci, Sch Informat Engn, Wuhan 30074, Hubei, Peoples R China.;[Poppe, Ronald; Veltkamp, Remco C.] Univ Utrecht, Dept Informat & Comp Sci, Princetonpl 5, Utrecht, Netherlands.;[Li, Baoxin] Arizona State Univ, Sch Comp, Informat, Decis Syst Engn, Tempe, AZ 85287 USA.

通讯机构： [Tu, Zhigang] W;[Xie, Wei] C;Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430079, Hubei, Peoples R China.;Cent China Normal Univ, Sch Comp, LuoyuRd 152, Wuhan, Hubei, Peoples R China.

关键词： Optical flow;Variational method;CNN-based method;Evaluation measures;Challenges

摘要： Dense motion estimations obtained from optical flow techniques play a significant role in many image processing and computer vision tasks. Remarkable progress has been made in both theory and its application in practice. In this paper, we provide a systematic review of recent optical flow techniques with a focus on the variational method and approaches based on Convolutional Neural Networks (CNNs). These two categories have led to state-of-the-art performance. We discuss recent modifications and extensions of the original model, and highlight remaining challenges. For the first time, we provide an overview of recent CNN-based optical flow methods and discuss their potential and current limitations.

语种：英文

展开

导出

原文链接

认领

Semantic Cues Enhanced Multimodality Multistream CNN for Action Recognition

作者： Tu, Zhigang*;Xie, Wei（谢伟）;Dauwels, Justin;Li, Baoxin;Yuan, Junsong

期刊： IEEE Transactions on Circuits and Systems for Video Technology,2019年29(5):1423-1437 ISSN：1051-8215

通讯作者： Tu, Zhigang

作者机构： [Tu, Zhigang] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430079, Hubei, Peoples R China.;[Xie, Wei] Cent China Normal Univ, Sch Comp, Wuhan 430079, Hubei, Peoples R China.;[Dauwels, Justin] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 637553, Singapore.;[Li, Baoxin] Arizona State Univ, Sch Comp, Decis Syst Engn, Informat, Tempe, AZ 85287 USA.;[Yuan, Junsong] SUNY Buffalo, Dept Comp Sci & Engn, Buffalo, NY 14260 USA.

通讯机构： [Tu, Zhigang] W;Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430079, Hubei, Peoples R China.

关键词： Action recognition;multi-modalities;multi-stream CNN;semantic cues;spatiotemporal saliency estimation;video object detection

摘要： This paper addresses the issue of video-based action recognition by exploiting an advanced multi-stream Convolutional Neural Network (CNN) to fully use semantics-derived multiple modalities in both spatial (appearance) and temporal (motion) domains, since the performance of the CNN-based action recognition methods heavily relate to two factors: semantic visual cues and the network architecture. Our work consists of two major parts. First, to extract useful human-related semantics accurately, we propose a novel spatiotemporal saliency based video object segmentation (STS-VOS) model. By fusing different distinctive saliency maps, which are computed according to object signatures of complementary object detection approaches, a refined spatiotemporal saliency maps can be obtained. In this way, various challenges in the realistic video can be handled jointly. Based on the estimated saliency maps, an energy function is constructed to segment two semantic cues: the actor and one distinctive acting part of the actor. Second, we modify the architecture of the two-stream network (TS-Net) to design a multi-stream network (MS-Net) that consists of three TS-Nets with respect to the extracted semantics, which is able to use deeper abstract visual features of multi-modalities in multi-scale spatiotemporally. Importantly, the performance of action recognition is significantly boosted when integrating the captured human-related semantics into our framework. Experiments on four public benchmarks JHMDB, HMDB51, UCF-Sports and UCF101 demonstrate that the proposed method outperforms the state of the art algorithms.

语种：英文

展开

导出

原文链接

认领

Neural Retrieval with Partially Shared Embedding Spaces

作者： Li, Bo*;Jia, Le

期刊： International Conference on Information and Knowledge Management, Proceedings,2018年:1739-1742

通讯作者： Li, Bo

作者机构： [Li, Bo; Jia, Le] Cent China Normal Univ, Sch Comp Sci, Wuhan, Hubei, Peoples R China.

通讯机构： [Li, Bo] C;Cent China Normal Univ, Sch Comp Sci, Wuhan, Hubei, Peoples R China.

会议名称： The 33rd ACM International Conference on Information and Knowledge Management

会议时间： October 21 - 25, 2024

会议地点： Boise , ID , USA

会议主办单位： [Li, Bo;Jia, Le] Cent China Normal Univ, Sch Comp Sci, Wuhan, Hubei, Peoples R China.

会议论文集名称： CIKM '18: Proceedings of the 27th ACM International Conference on Information and Knowledge Management

关键词： Adversarial learning;Feature separation;Neural information;Neural retrieval;Pairwise matching;Shared embedding space;Text representation;TREC collection;Knowledge management

摘要： One category of neural information retrieval models tries to learn text representation in a common embedding space for both queries and documents. However, a single embedding space is not always sufficient, since queries and documents are different in terms of length, number of topics covered, etc. We argue that queries and documents should be mapped into different but overlapping embedding spaces, which is named Partially Shared Embedding Space (PSES) model in this paper. PSES consists of two embedding spaces respectively for queries and documents, and a shared embedding space capturing common features of two sources. Those three embeddings are learned by jointly obeying three constraints: a feature separation constraint, a pairwise matching constraint, and a reconstruction constraint. Experiments on standard TREC collections indicate that PSES leads to significant better performance of retrieval over traditional IR models and several neural IR models with only one embedding space. © 2018 Copyright held by the owner/author(s). Publication rights licensed to ACM.

语种：英文

展开

导出

原文链接

认领

基于词向量与可比语料库的双语词典提取研究

作者：柳路芳;李波;陈鹏;周凌寒;王兵

期刊： 计算机工程与科学,2018年40(2):368-373 ISSN：1007-130X

作者机构：华中师范大学计算机学院, 湖北, 武汉, 430079;北京吉威时代软件股份有限公司, 北京, 100043;[柳路芳; 李波; 陈鹏; 周凌寒] 华中师范大学计算机学院, 湖北, 武汉, 430079;[王兵] 北京吉威时代软件股份有限公司, 北京, 100043

关键词：双语词典;词向量;词间关系;可比语料库

摘要：双语词典是跨语言信息检索以及机器翻译等自然语言处理应用中的一项重要资源。现有的基于可比语料库的双语词典提取算法不够成熟,抽取效果有待提高,而且大多数研究都集中在特定领域的专业术语抽取。针对此不足,提出了一种基于词向量与可比语料库的双语词典提取算法。首先给出了该算法的基本假设以及相关的研究方法,然后阐述了基于词向量利用词间关系矩阵从可比语料库中提取双语词典的具体步骤,最后将该抽取方法与经典的向量空间模型做对比,通过实验分析了上下文窗口大小、种子词典大小、词频等因素对两种模型抽取效果的影响。实验表明,与基于向量空间模型的方法相比,本算法的抽取效果有着明显的提升,尤其是对于高频词语其准确率提升最为显著。

语种：中文

展开

导出

原文链接

认领

Measuring bilingual corpus comparability

作者： Li, Bo*;Gaussier, Eric;Yang, Dan

期刊： NATURAL LANGUAGE ENGINEERING,2018年24(4):523-549 ISSN：1351-3249

通讯作者： Li, Bo

作者机构： [Li, Bo] Cent China Normal Univ, Dept Comp Sci, Wuhan, Hubei, Peoples R China.;[Gaussier, Eric] Univ Grenoble Alpes, CNRS, LIG, AMA, Grenoble, France.;[Yang, Dan] China Elect Power Res Inst, Wuhan, Hubei, Peoples R China.

通讯机构： [Li, Bo] C;Cent China Normal Univ, Dept Comp Sci, Wuhan, Hubei, Peoples R China.

关键词： Software engineering;Bilingual corpora;Bilingual dictionary;Bilingual lexicon extractions;Comparable corpora;Gold standards;Real-world;Under-resourced languages;Artificial intelligence

摘要： Comparable corpora serve as an important substitute for parallel resources in cases of under-resourced language pairs. Previous work mostly aims to find a better strategy to exploit existing comparable corpora, while ignoring the variety in corpus quality. The quality of comparable corpora affects a lot its usability in practice, a fact that has been justified by several studies. However, researchers have not been able to establish a widely accepted and fully validated framework to measure corpus quality. We will thus investigate in this paper a comprehensive methodology to deal with the quality of comparable corpora. To be exact, we will propose several comparability measures and a quantitative strategy to test those measures. Our experiments show that the proposed comparability measure can capture gold-standard comparability levels very well and is robust to the bilingual dictionary used. Moreover, we will show in the task of bilingual lexicon extraction that the proposed measure correlates well with the performance of the real world application.

语种：英文

展开

导出

原文链接

认领

Multi-stream CNN: Learning representations based on human-related regions for action recognition

作者： Tu, Zhigang;Xie, Wei（谢伟）;Qin, Qianqing*;Poppe, Ronald;Veltkamp, Remco C.;...

期刊： Pattern Recognition,2018年79(1):32-43 ISSN：0031-3203

通讯作者： Qin, Qianqing

作者机构： [Tu, Zhigang] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore.;[Xie, Wei] Cent China Normal Univ, Sch Comp, LuoyuRd 152, Wuhan, Hubei, Peoples R China.;[Qin, Qianqing] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, LuoyuRd 129, Wuhan, Hubei, Peoples R China.;[Poppe, Ronald; Veltkamp, Remco C.] Univ Utrecht, Dept Informat & Comp Sci, Princetonpl 5, Utrecht, Netherlands.;[Li, Baoxin] Arizona State Univ, Sch Comp Informat, Decis Syst Engn, Tempe, AZ 85287 USA.

通讯机构： [Qin, Qianqing] W;Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, LuoyuRd 129, Wuhan, Hubei, Peoples R China.

关键词： Action recognition;Convolutional Neural Network;Motion salient region;Multi-Stream

摘要： The most successful video-based human action recognition methods rely on feature representations extracted using Convolutional Neural Networks (CNNs). Inspired by the two-stream network (TS-Net), we propose a multi-stream Convolutional Neural Network (CNN) architecture to recognize human actions. We additionally consider human-related regions that contain the most informative features. First, by improving foreground detection, the region of interest corresponding to the appearance and the motion of an actor can be detected robustly under realistic circumstances. Based on the entire detected human body, we construct one appearance and one motion stream. In addition, we select a secondary region that contains the major moving part of an actor based on motion saliency. By combining the traditional streams with the novel human-related streams, we introduce a human-related multi-stream CNN (HR-MSCNN) architecture that encodes appearance, motion, and the captured tubes of the human-related regions. Comparative evaluation on the JHMDB, HMDB51, UCF Sports and UCF101 datasets demonstrates that the streams contain features that complement each other. The proposed multi-stream architecture achieves state-of-the-art results on these four datasets. (C) 2018 Elsevier Ltd. All rights reserved.

语种：英文

展开

导出

原文链接

认领

The Dilution/Concentration conditions for cross-language information retrieval models

作者： Li, Bo*;Gaussier, Eric;Yang, Dan

期刊： Information Processing & Management,2018年54(2):291-302 ISSN：0306-4573

通讯作者： Li, Bo

作者机构： [Li, Bo] Cent China Normal Univ, Sch Comp Sci, Wuhan, Hubei, Peoples R China.;[Gaussier, Eric] Univ Grenoble Alpes, CNRS, LIG AMA, Grenoble, France.;[Yang, Dan] China Elect Power Res Inst, Wuhan, Hubei, Peoples R China.

通讯机构： [Li, Bo] C;Cent China Normal Univ, Sch Comp Sci, Wuhan, Hubei, Peoples R China.

关键词： Cross-language information retrieval;D/C condition;Information retrieval heuristic

摘要： Experimental results of cross-language information retrieval (CLIR) do not indicate why a model fails or how a model could be improved. One basic research question is thus whether it is possible to provide conditions by which one can evaluate any existing or new CLIR strategy analytically and one can improve the design of CLIR models. Inspired by the heuristics in monolingual IR, we introduce in this paper Dilution/Concentration (D/C) conditions to characterize good CLIR models based on direct intuitions under artificial settings. The conditions, derived from first principles in CLIR, generalize the idea of query structuring approach. Empirical results with state-of-the-art CLIR models show that when a condition is not satisfied, it often indicates non-optimality of the method. In general, we find that the empirical performance of a retrieval formula is tightly related to how well it satisfies the conditions. Lastly, we propose, by following the D/C conditions, several novel CLIR models based on the information-based models, which again shows that the D/C conditions are efficient to feature good CLIR models.

语种：英文

展开

导出

原文链接

认领

Centralized Group Key Establishment Protocol without a Mutually Trusted Third Party

作者： Harn, Lein;Hsu, Ching-Fang*;Li, Bohan

期刊： MOBILE NETWORKS & APPLICATIONS,2018年23(5):1132-1140 ISSN：1383-469X

通讯作者： Hsu, Ching-Fang

作者机构： [Harn, Lein] Univ Missouri, Dept Comp Sci Elect Engn, Kansas City, MO 64110 USA.;[Hsu, Ching-Fang] Cent China Normal Univ, Comp Sch, Wuhan 430079, Hubei, Peoples R China.;[Li, Bohan] Huazhong Univ Sci & Technol, Sch Opt & Elect Informat, Wuhan 430074, Hubei, Peoples R China.

通讯机构： [Hsu, Ching-Fang] C;Cent China Normal Univ, Comp Sch, Wuhan 430079, Hubei, Peoples R China.

关键词： Group key establishment;Centralized server;Key generation center;Mutually trusted server;Secret sharing homomorphism;Bivariate polynomial

摘要： The type of centralized group key establishment protocols is the most commonly used one due to its efficiency in computation and communication. A key generation center (KGC) in this type of protocols acts as a server to register users initially. Since the KGC selects a group key for group communication, all users must trust the KGC. Needing a mutually trusted KGC can cause problem in some applications. For example, users in a social network cannot trust the network server to select a group key for a secure group communication. In this paper, we remove the need of a mutually trusted KGC by assuming that each user only trusts himself. During registration, each user acts as a KGC to register other users and issue sub-shares to other users. From the secret sharing homomorphism, all sub-shares of each user can be combined into a master share. The master share enables a pairwise shared key between any pair of users. A verification of master shares enables all users to verify their master shares are generated consistently without revealing the master shares. In a group communication, the initiator can become the server to select a group key and distribute it to each other user over a pairwise shared channel. Our design is unique since the storage of each user is minimal, the verification of master shares is efficient and the group key distribution is centralized. There are public-key based group key establishment protocols without a trusted third party. However, these protocols can only establish a single group key. Our protocol is a non-public-key solution and can establish multiple group keys which is computationally efficient.

语种：英文

展开

导出

原文链接

认领

Fusing disparate object signatures for salient object detection in video

作者： Tu, Zhigang;Guo, Zuwei;Xie, Wei*（谢伟）;Yan, Mengjia;Veltkamp, Remco C.;...

期刊： Pattern Recognition,2017年72:285-299 ISSN：0031-3203

通讯作者： Yuan, Junsong;Xie, Wei（谢伟）

作者机构： [Tu, Zhigang; Yan, Mengjia; Yuan, Junsong] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore.;[Xie, Wei] Cent China Normal Univ, Sch Comp, LuoyuRd 152, Wuhan, Peoples R China.;[Veltkamp, Remco C.] Univ Utrecht, Dept Informat & Comp Sci, Princetonpl 5, Utrecht, Netherlands.;[Guo, Zuwei; Li, Baoxin; Tu, Zhigang] Arizona State Univ, Sch Comp Informat Decis Syst Engn, Tempe, AZ 85287 USA.

通讯机构： [Xie, Wei] C;[Yuan, Junsong] N;Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore.;Cent China Normal Univ, Sch Comp, LuoyuRd 152, Wuhan, Peoples R China.

关键词： Fusion;Object signatures;Salient video object detection;Spatiotemporal saliency computation

摘要： We present a novel spatiotemporal saliency model for object detection in videos. In contrast to previous methods focusing on exploiting or incorporating different saliency cues, the proposed method aims to use object signatures which can be identified by any kinds of object segmentation methods. We integrate two distinctive saliency maps, which are respectively computed from object proposals of an appearance-dominated method and a motion-dominated algorithm, to obtain a refined spatiotemporal saliency maps. This enables the method to achieve good robustness and precision in identifying salient objects in videos under various challenging conditions. First, an improved appearance-based and a modified motion-based segmentation approaches are separately utilized to extract two kinds of candidate foreground objects. Second, with these captured object signatures, we design a new approach to filter the extracted noisy object pixels and label foreground superpixels in each object signature channel. Third, we introduce a foreground connectivity saliency measure to compute two types of saliency maps, from which an adaptive fusion strategy is exploited to obtain the final spatiotemporal saliency maps for salient object detection in a video. Both quantitative and qualitative experiments on several challenging video benchmarks demonstrate that the proposed method outperforms existing state-of-the-art approaches. © 2017 Elsevier Ltd

语种：英文

展开

导出

原文链接

认领

基于可比语料库的双语词典抽取方法比较研究

作者：李舰;李波;陈鹏;杨丹

期刊： 小型微型计算机系统,2017年38(7):1554-1561 ISSN：1000-1220

作者机构：华中师范大学计算机学院, 武汉, 430079;中国电力科学研究院, 武汉, 430074;[李舰; 李波; 陈鹏] 华中师范大学计算机学院, 武汉, 430079;[杨丹] 中国电力科学研究院, 武汉, 430074

关键词：可比语料库;双语词典抽取;上下文向量;词向量

摘要：双语词典是一种重要的语言资源,但现有的基于可比语料库的双语词典抽取方法在体系结构、所依赖的基础性资源等方面差异较大,这使得在统一的实验条件下对各种算法进行比较变得很困难.因此,目前的研究工作多选择将性能评测任务限定在很狭小的范围内,缺乏统一的评测结果给双语词典抽取任务的发展和算法的选择带来一定困难.为解决上述问题,选取并实现了四种代表性的双语词典抽取方案,在统一的测试数据集上进行比较研究.在比较研究中,我们重点揭示了词典抽取任务中几种关键因素如语料库大小、训练词典大小等对各算法性能的不同影响程度.本文的结论对今后相关工作中的实验设计、性能比较与算法选用都具有重要的理论意义和实践价值.

语种：中文

展开

导出

原文链接

认领

Identifying top Chinese network buzzwords from social media big data set based on time-distribution features

作者： Tang, Yongli*;He, Tingting（何婷婷）;Li, Bo;Hu, Xiaohua

作者机构： [He, Tingting; Li, Bo; Tang, Yongli] Cent China Normal Univ, Sch Comp, Wuhan, Peoples R China.;[Hu, Xiaohua] Drexel Univ, Coll Informat Sci & Technol, Philadelphia, PA 19104 USA.

会议名称： 2014 IEEE International Conference on Big Data (Big Data)

会议时间： October 2014

会议地点： Washington, DC, USA

会议主办单位： [Tang, Yongli;He, Tingting;Li, Bo] Cent China Normal Univ, Sch Comp, Wuhan, Peoples R China.^[Hu, Xiaohua] Drexel Univ, Coll Informat Sci & Technol, Philadelphia, PA 19104 USA.

会议论文集名称： 2014 IEEE International Conference on Big Data (Big Data)

关键词： buzzword;time-distribution;language model;KL divergence

摘要： Buzzwords are the main embodiment of Internet culture, which play an important role in public opinion analysis, social focus tracking and language evolution study. At present, questionnaire has been wildly used as a standard method to obtain network buzzwords, which is subjective and costly. In this paper, we will propose a novel algorithm relying on the time-distribution feature of words and a KL-divergence measure to estimate words' popularity so as to figure out buzzwords in a specific period. The time-distribution feature simply states the fact that buzzwords' usage has a sharp increase during a very short period, which is then modeled formally with the KL-divergence measure. Compared with traditional method involving much workforce, the automatic algorithm presented here is clearly more efficient. Moreover, buzzwords identified in this manner will not be affected by individual's subjective opinions, so they can reflect the language usage in practice better. When applying the algorithm to a social media big data set, our experimental results show that the proposed approach can accurately identify buzzwords in a certain period, which is highly coincident with results tagged manually.

语种：英文

展开

导出

原文链接

认领

Combining Lexical Context with Pseudo-alignment for Bilingual Lexicon Extraction from Comparable Corpora

作者： Li, Bo*;Zhu, Qunyan;He, Tingting（何婷婷）;Chen, Qianjun

期刊： Lecture Notes in Computer Science,2014年8801:223-233 ISSN：0302-9743

通讯作者： Li, Bo

作者机构： [He, Tingting; Li, Bo; Chen, Qianjun; Zhu, Qunyan] Cent China Normal Univ, Hubei Univ, Sch Comp,Ctr Natl Language Tracing & Res Network, Natl Engn Res Ctr E Learning,Network Ctr, Wuhan 430079, Peoples R China.

通讯机构： [Li, Bo] C;Cent China Normal Univ, Hubei Univ, Sch Comp,Ctr Natl Language Tracing & Res Network, Natl Engn Res Ctr E Learning,Network Ctr, Wuhan 430079, Peoples R China.

会议名称： 13th China National Conference on Chinese Computational Linguistics (CCL) / 2nd International Symposium on Natural Language Processing Based on Naturally Annotated Big Data (NLP-NABD)

会议时间： OCT 18-19, 2014

会议地点： Cent China Normal Univ, Wuhan, PEOPLES R CHINA

会议主办单位： Cent China Normal Univ

会议论文集名称： Lecture Notes in Computer Science

关键词： Extraction;Bilingual lexicon extractions;Comparable corpora;Context vector;Essential features;Language independents;Lexical contexts;Alignment

摘要： Only a few studies have made use of alignment information in bilingual lexicon extraction from comparable corpora, in which comparable corpora are necessarily divided into 1-1 aligned document pairs. They have not been able to show extracted lexicons benefit from the embedding of alignment information. Moreover, strict 1-1 alignments do not exist broadly in comparable corpora. We develop in this paper a language-independent approach to lexicon extraction by combining the classic lexical context with pseudo-alignment information. Experiments on the English-French comparable corpus demonstrate that pseudo-alignment in comparable corpora is an essential feature leading to a significant improvement of standard method of lexicon extraction, a perspective that have never been investigated in a similar way by previous studies. ©Springer International Publishing Switzerland 2014.

语种：英文

展开

导出

原文链接

认领

1 2 共 2 页

成果认领

标题：

用户	作者	通讯作者	--
	请选择	请选择	--

确定

取消

成果认领

提示

该栏目需要登录且有访问权限才可以访问