IEEJ Transactions on Electronics, Information and Systems
Online ISSN : 1348-8155
Print ISSN : 0385-4221
ISSN-L : 0385-4221
<Speech and Image Processing, Recognition>
Speech Intelligibility Estimation using Support Vector Regression and Critical Band Segmental SNR in Noisy Condition
Yosuke KobayashiKazuhiro Kondo
Author information
JOURNAL FREE ACCESS

2013 Volume 133 Issue 8 Pages 1556-1564

Details
Abstract
In this paper, we propose a speech intelligibility estimation method using the Support Vector Regression (SVR) with the normalized segmental Signal-to-Noise Ratio in 25 critical bands (cbSNRseg). In the proposed method, estimation was done in the target 32 noise environments which were classified into 3 clusters by the ambient noise clustering method with MIR (Music Information Retrieval) features and the x-means algorithm. Next, We compared cbSNRseg and 1/3 octave bands SNRseg (obSNRseg) and used the cross-validation RMSE in 5 regression methods including SVR. As a result, the weighted sum of RMSE using cbSNRseg was better than obSNRseg with RMSE reduction factor of about 0.8 compared to all other regression methods. Finally, we compared the performance of each regression methods in open tests. As a result, the best regression method was the SVR using the RBF kernel, in which RMSE is reduced by a factor of about 0.7 compared to other regression methods.
Content from these authors
© 2013 by the Institute of Electrical Engineers of Japan
Previous article Next article
feedback
Top