In the frequency-domain speech enhancement algorithms,the performance is difficult to break the inherent upper limit due to the mismatch between the estimated amplitude spectrum and the band-noise phase spectrum.In the time-domain speech enhancement framework,time-domain waveform is taken as the input of the model and the mapping relationship between time-domain waveforms is learned directly by the network,which effectively avoids the invalid Short-Time Fourier Transform(STFT)problem.However,the common time-domain speech enhancement algorithm using waveform minimum mean square error does not a...