版权说明 操作指南
首页 > 成果 > 详情

A question-guided multi-hop reasoning graph network for visual question answering

认领
导出
Link by DOI
反馈
分享
QQ微信 微博
成果类型:
期刊论文
作者:
Xu, Zhaoyang;Gu, Jinguang;Liu, Maofu;Zhou, Guangyou;Fu, Haidong;...
通讯作者:
Chen Qiu
作者机构:
[Gu, Jinguang; Qiu, Chen; Xu, Zhaoyang; Liu, Maofu; Fu, Haidong] Wuhan Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan 430081, Peoples R China.
[Gu, Jinguang; Qiu, Chen; Xu, Zhaoyang; Liu, Maofu; Fu, Haidong] Wuhan Univ Sci & Technol, Inst Big Data Sci & Engn, Wuhan 430081, Peoples R China.
[Zhou, Guangyou] Cent China Normal Univ, Sch Comp Sci, Wuhan 430079, Peoples R China.
通讯机构:
[Chen Qiu] S
School of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan, 430081, China<&wdkj&>Institute of Big Data Science and Engineering, Wuhan University of Science and Technology, Wuhan, 430081, China
语种:
英文
关键词:
Multi-hop reasoning;Reasoning graph network;Visual question answering
期刊:
Information Processing & Management
ISSN:
0306-4573
年:
2023
卷:
60
期:
2
页码:
103207
基金类别:
This work was supported by the National Natural Science Foundation of China under Grants 61972173 , the Joint Funds of the National Natural Science Foundation of China under Grants U1836118 , and the Fundamental Research Funds for the Central Universities (No. CCNU22QN015 ). We thank Emanuele Bugliarello for the constructive feedback regarding the VOLTA framework implementation. We are grateful to the anonymous reviewers and the associate editor for their insightful comments.
机构署名:
本校为其他机构
院系归属:
计算机学院
摘要:
Visual Question Answering (VQA) requires reasoning about the visually-grounded relations in the image and question context. A crucial aspect of solving complex questions is reliable multi-hop reasoning, i.e., dynamically learning the interplay between visual entities in each step. In this paper, we investigate the potential of the reasoning graph network on multi-hop reasoning questions, especially over 3 “hops.” We call this model QMRGT: A Question-Guided Multi-hop Reasoning Graph Network. It constructs a cross-modal interaction module (CIM)...

反馈

验证码:
看不清楚,换一个
确定
取消

成果认领

标题:
用户 作者 通讯作者
请选择
请选择
确定
取消

提示

该栏目需要登录且有访问权限才可以访问

如果您有访问权限,请直接 登录访问

如果您没有访问权限,请联系管理员申请开通

管理员联系邮箱:yun@hnwdkj.com