IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
Online ISSN : 1745-1337
Print ISSN : 0916-8508
Regular Section
Joint Optimization of Perceptual Gain Function and Deep Neural Networks for Single-Channel Speech Enhancement
Wei HANXiongwei ZHANGGang MINXingyu ZHOUMeng SUN
著者情報
ジャーナル 認証あり

2017 年 E100.A 巻 2 号 p. 714-717

詳細
抄録

In this letter, we explore joint optimization of perceptual gain function and deep neural networks (DNNs) for a single-channel speech enhancement task. A DNN architecture is proposed which incorporates the masking properties of the human auditory system to make the residual noise inaudible. This new DNN architecture directly trains a perceptual gain function which is used to estimate the magnitude spectrum of clean speech from noisy speech features. Experimental results demonstrate that the proposed speech enhancement approach can achieve significant improvements over the baselines when tested with TIMIT sentences corrupted by various types of noise, no matter whether the noise conditions are included in the training set or not.

著者関連情報
© 2017 The Institute of Electronics, Information and Communication Engineers
前の記事 次の記事
feedback
Top