HVLM: Exploring Human-Like Visual Cognition and Language-Memory Network for Visual Dialog

首页 > 成果 > 详情

认领

导出

Link by DOI

反馈

作者信息关键词期刊信息基础信息归属信息摘要

成果类型：

期刊论文

作者：

Sun, Kaili;Guo, Chi;Zhang, Huyin;Li, Yuan

通讯作者：

Chi Guo<&wdkj&>Huyin Zhang

作者机构：

[Sun, Kaili; Zhang, Huyin] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China.

[Guo, Chi] Wuhan Univ, Inst Artificial Intelligence, Sch Comp Sci, Wuhan 430072, Peoples R China.

[Guo, Chi] Wuhan Univ, Luojia Lab, Wuhan 430072, Peoples R China.

[Li, Yuan] Cent China Normal Univ, Sch Comp Sci, Wuhan 430079, Peoples R China.

通讯机构：

[Chi Guo] I

[Huyin Zhang] S

School of Computer Science, Wuhan University, Wuhan 430072, China<&wdkj&>Institute of Artificial Intelligence, School of Computer Science, Wuhan University, Wuhan 430072, China<&wdkj&>Luojia Laboratory, Wuhan University, Wuhan 430072, China

语种：

英文

关键词：

Dual-perspective reasoning;Simple spectral graph convolution network;Visual Dialog;Visual-language understanding

期刊：

Information Processing & Management

ISSN：

0306-4573

年：

2022

卷：

期：

页码：

103008

DOI：

10.1016/j.ipm.2022.103008

基金类别：

This work was supported in part by The National Key Research and Development Program of China under Grant no. 2018YFB1305001 , The Major Science and Technology Project of Hubei Province under Grant no. 2021AAA010 , and The National Social Science Fund of China under Grant no. 18BYY174 .

机构署名：

本校为其他机构

院系归属：

计算机学院

摘要：

Visual dialog, a visual-language task, enables an AI agent to engage in conversation with humans grounded in a given image. To generate appropriate answers for a series of questions in the dialog, the agent is required to understand the comprehensive visual content of an image and the fine-grained textual context of the dialog. However, previous studies typically utilized the object-level visual feature to represent a whole image, which only focuses on the local perspective of an image but ignores the importance of the global information in an ...

反馈

产权有误：本人成果被他人认领

数据有误：数据基本信息有误

归属有误：成果的院系归属、机构署名归属有误

其他原因：

验证码：

看不清楚，换一个

确定

取消

成果认领

标题：

用户	作者	通讯作者	--
	请选择	请选择	--

确定

取消

HVLM: Exploring Human-Like Visual Cognition and Language-Memory Network for Visual Dialog

反馈

成果认领

提示

该栏目需要登录且有访问权限才可以访问