基于时频联合损失函数的语音增强算法

首页 > 成果 > 详情

认领

导出

Link by 中国知网学术期刊 Link by 万方学术期刊

反馈

作者信息关键词期刊信息基础信息归属信息摘要

成果类型：

期刊论文

论文标题(英文)：

Speech enhancement algorithm based on time-frequency joint loss function

作者：

高戈;王霄;曾邦;尹文兵;陈怡

作者机构：

[高戈; 王霄; 尹文兵; 曾邦] 国家多媒体软件工程技术研究中心(武汉大学),武汉430072

华中师范大学计算机学院,武汉430079

[陈怡] 华中师范大学

语种：

中文

关键词：

时域语音增强;联合损失函数;语音通信;语音识别;深度学习

关键词(英文)：

time domain speech enhancement;joint loss function;speech communication;speech recognition;deep learning

期刊：

计算机应用

ISSN：

1001-9081

年：

2022

卷：

期：

S01

页码：

316-320

DOI：

10.11772/j.issn.1001-9081.2021071325

机构署名：

本校为其他机构

院系归属：

计算机学院

摘要：

在频域语音增强算法中，由于估计幅度谱与带噪相位谱的不匹配，其性能难以突破固有上限。在时域语音增强框架中，模型将时域波形作为输入，由网络直接学习时域波形之间的映射关系，有效地避免了无效短时傅里叶变换（STFT）问题。然而，常见的采用波形最小均方误差的时域语音增强算法对语音频域特征的建模并未达到最优。针对这个问题，提出一种基于时频联合损失函数的语音增强算法。首先将时频联合损失函数应用到Wave-U-Net时域语音增强网络，同时设计并分析了一阶范数形式和二阶范数形式的时频联合损失函数对增强网络的影响，最后得到了面向语音通信任务和语音识别任务的相对最佳损失函数选择方...

摘要(英文)：

In the frequency-domain speech enhancement algorithms,the performance is difficult to break the inherent upper limit due to the mismatch between the estimated amplitude spectrum and the band-noise phase spectrum.In the time-domain speech enhancement framework,time-domain waveform is taken as the input of the model and the mapping relationship between time-domain waveforms is learned directly by the network,which effectively avoids the invalid Short-Time Fourier Transform(STFT)problem.However,the common time-domain speech enhancement algorithm using waveform minimum mean square error does not a...

反馈

产权有误：本人成果被他人认领

数据有误：数据基本信息有误

归属有误：成果的院系归属、机构署名归属有误

其他原因：

验证码：

看不清楚，换一个

确定

取消

成果认领

标题：

用户	作者	通讯作者	--
	请选择	请选择	--

确定

取消

基于时频联合损失函数的语音增强算法

反馈

成果认领

提示

该栏目需要登录且有访问权限才可以访问