IEEJ Transactions on Electronics, Information and Systems
Online ISSN : 1348-8155
Print ISSN : 0385-4221
ISSN-L : 0385-4221
<Speech and Image Processing, Recognition>
On Improvement to Non-Reference Speech Intelligibility Estimation Accuracy for Reverberant Speech
Kazushi NakazawaKazuhiro Kondo
Author information
JOURNAL RESTRICTED ACCESS

2023 Volume 143 Issue 8 Pages 830-841

Details
Abstract

In this paper, we made improvements and evaluated our proposed model for non-reference speech intelligibility estimation on reverberant speech, attempting to improve the estimation accuracy significantly. Our proposed method consists of a DNN for speech enhancement and a separate DNN for intelligibility estimation. The latter uses features obtained from enhanced and degraded speech to estimate intelligibility. Although previous studies have effectively estimated intelligibility for speech degraded by additive noise using similar models, they did not consider the degradation of distortion caused by reverberation. They also did not quantify the effect of various speech enhancement DNN models, the structure of the intelligibility prediction DNN, and the selection of parameters during feature calculation on estimation accuracy. Accordingly, we compared two top-of-the-line speech enhancement DNN models and used their output to train intelligibility prediction DNNs for reverberant speech while also varying the parameters used in the feature calculation. Consequently, the linear correlation coefficient between subjective and estimated intelligibility came to 0.801 with the best combination.

Content from these authors
© 2023 by the Institute of Electrical Engineers of Japan
Previous article Next article
feedback
Top