版权说明 操作指南
首页 > 成果 > 详情

Dynamic interactive learning network for audio-visual event localization

认领
导出
Link by DOI
反馈
分享
QQ微信 微博
成果类型:
期刊论文
作者:
Chen, Jincai;Liang, Han;Wang, Ruili;Zeng, Jiangfeng;Lu, Ping
通讯作者:
Zeng, JF
作者机构:
[Liang, Han; Chen, Jincai] Huazhong Univ Sci & Technol, Wuhan Natl Lab Optoelect, Wuhan, Peoples R China.
[Chen, Jincai; Lu, Ping] Huazhong Univ Sci & Technol, Inst Nat & Math Sci, Wuhan, Peoples R China.
[Wang, Ruili; Liang, Han] Massey Univ, Inst Nat & Math Sci, Auckland, New Zealand.
[Zeng, Jiangfeng] Cent China Normal Univ, Sch Informat Management, Wuhan, Peoples R China.
[Zeng, Jiangfeng] Ctr Data Governance & Intelligent Decis Making Hub, Wuhan, Peoples R China.
通讯机构:
[Zeng, JF ] C
Cent China Normal Univ, Sch Informat Management, Wuhan, Peoples R China.
Ctr Data Governance & Intelligent Decis Making Hub, Wuhan, Peoples R China.
语种:
英文
关键词:
Audio-visual event localization;Dynamic fusion;Attention mechanism;Difference loss
期刊:
Applied Intelligence
ISSN:
0924-669X
年:
2023
卷:
53
期:
24
页码:
30431-30442
基金类别:
This work was supported by the National Natural Science Foundation of China under Grant No. 62102159, No. 62272178 and the Humanities and Social Science Fund of Ministry of Education of China under Grant No. 21YJC870002 and the Fundamental Research Funds for the Central Universities under Grant No. CCNU22QN017 and Knowledge Innovation Program of Wuhan-Shuguang Project under Grant No. 2022010801020287 and the Natural Science Foundation of Hubei Province under grant No. 2023AFB1018. The authors gratelfully acknowledge financial support from China Scholarship Council (CSC).
机构署名:
本校为通讯机构
院系归属:
信息管理学院
摘要:
Audio-visual event (AVE) localization aims to detect whether an event exists in each video segment and predict its category. Only when the event is audible and visible can it be recognized as an AVE. However, sometimes the information from auditory and visual modalities is asymmetrical in a video sequence, leading to incorrect predictions. To address this challenge, we introduce a dynamic interactive learning network designed to dynamically explore the intra- and inter-modal relationships depending on the other modality for better AVE localizat...

反馈

验证码:
看不清楚,换一个
确定
取消

成果认领

标题:
用户 作者 通讯作者
请选择
请选择
确定
取消

提示

该栏目需要登录且有访问权限才可以访问

如果您有访问权限,请直接 登录访问

如果您没有访问权限,请联系管理员申请开通

管理员联系邮箱:yun@hnwdkj.com