Journal of Signal Processing
Online ISSN : 1880-1013
Print ISSN : 1342-6230
ISSN-L : 1342-6230
Improvement in Bone-Conducted Speech Restoration Using Linear Prediction and Long Short-Term Memory Model
Huy Quoc NguyenMasashi Unoki
著者情報
ジャーナル フリー

2020 年 24 巻 4 号 p. 175-178

詳細
抄録

Bone-conducted (BC) speech has a significant advantage as a solution for speech communication in an extremely noisy environment because of its stability against surrounding noise. However, the quality and intelligibility of BC speech degrade, making BC speech difficult to restore. To solve this problem, we propose a method for restoring BC speech with a combination of a linear prediction (LP) model using line spectral frequencies (LSFs) and a long short-term memory (LSTM) model. We evaluated the method using three objective measurements: log-spectrum distortion, LP coefficient distance, and a perceptual evaluation of speech quality. The results of all three measurements show that our method is better than the previous method, which used a simple recurrent network. These results also show that the model can yield speech with better quality when the LP gain is estimated more accurately.

著者関連情報
© 2020 Research Institute of Signal Processing, Japan
前の記事 次の記事
feedback
Top