版权说明 操作指南
首页 > 成果 > 详情

Unsupervised learning of visual and semantic features for video summarization

认领
导出
Link by DOI
反馈
分享
QQ微信 微博
成果类型:
会议论文
作者:
Huang, Yansen;Zhong, Rui*;Yao, Wenjin;Wang, Rui
通讯作者:
Zhong, Rui
作者机构:
[Zhong, Rui; Huang, Yansen; Yao, Wenjin; Wang, Rui] Cent China Normal Univ, Sch Comp, Wuhan, Peoples R China.
通讯机构:
[Zhong, Rui] C
Cent China Normal Univ, Sch Comp, Wuhan, Peoples R China.
语种:
英文
关键词:
Video Summarization;Attention model;Bi-LSTM
期刊:
Proceedings - IEEE International Symposium on Circuits and Systems
ISSN:
0271-4310
年:
2021
会议名称:
IEEE International Symposium on Circuits and Systems (IEEE ISCAS)
会议论文集名称:
IEEE International Symposium on Circuits and Systems
会议时间:
MAY 22-28, 2021
会议地点:
Daegu, SOUTH KOREA
会议主办单位:
[Huang, Yansen;Zhong, Rui;Yao, Wenjin;Wang, Rui] Cent China Normal Univ, Sch Comp, Wuhan, Peoples R China.
会议赞助商:
IEEE
出版地:
345 E 47TH ST, NEW YORK, NY 10017 USA
出版者:
IEEE
ISBN:
978-1-7281-9201-7
基金类别:
self-determined research funds of CCNU from the colleges' basic research and operation of MOE; National Natural Science Foundation of ChinaNational Natural Science Foundation of China (NSFC) [62002130]; Hubei Province Technological Innovation Major Project [2019AAA049]
机构署名:
本校为第一且通讯机构
院系归属:
计算机学院
摘要:
The high redundancy among keyframes is a critical issue for the existing summarizing methods in dealing with user-created videos. To address the critical issue, we present an unsupervised learning method, Spatial Attention Model guided Bi-directional Long Short-term Memory network (Bi-LSTM), on the combination of visual and semantic features. As for the visual feature, we design a Salient-Area-Size-based spatial attention model on the observation that humans tend to focus on sizable and moving objects in videos. Moreover, the Bi-LSTM network is leveraged to exploit the semantic feature. Afterw...

反馈

验证码:
看不清楚,换一个
确定
取消

成果认领

标题:
用户 作者 通讯作者
请选择
请选择
确定
取消

提示

该栏目需要登录且有访问权限才可以访问

如果您有访问权限,请直接 登录访问

如果您没有访问权限,请联系管理员申请开通

管理员联系邮箱:yun@hnwdkj.com