Journal of Signal Processing
Online ISSN : 1880-1013
Print ISSN : 1342-6230
ISSN-L : 1342-6230
Improvement in Bone-Conducted Speech Restoration Using Linear Prediction and Long Short-Term Memory Model
Huy Quoc NguyenMasashi Unoki
Author information
JOURNAL FREE ACCESS

2020 Volume 24 Issue 4 Pages 175-178

Details
Abstract

Bone-conducted (BC) speech has a significant advantage as a solution for speech communication in an extremely noisy environment because of its stability against surrounding noise. However, the quality and intelligibility of BC speech degrade, making BC speech difficult to restore. To solve this problem, we propose a method for restoring BC speech with a combination of a linear prediction (LP) model using line spectral frequencies (LSFs) and a long short-term memory (LSTM) model. We evaluated the method using three objective measurements: log-spectrum distortion, LP coefficient distance, and a perceptual evaluation of speech quality. The results of all three measurements show that our method is better than the previous method, which used a simple recurrent network. These results also show that the model can yield speech with better quality when the LP gain is estimated more accurately.

Content from these authors
© 2020 Research Institute of Signal Processing, Japan
Previous article Next article
feedback
Top