Journal of Signal Processing
Online ISSN : 1880-1013
Print ISSN : 1342-6230
ISSN-L : 1342-6230
Combination of SPLICE and Feature Normalization for Noise Robust Speech Recognition
Tsunenobu KaiMasayuki SuzukiKeigo ChijiiwaNobuaki MinematsuKeikichi Hirose
Author information
JOURNAL FREE ACCESS

2012 Volume 16 Issue 4 Pages 323-326

Details
Abstract
It is well known that the performance of automatic speech recognition (ASR) systems is easily affected by acoustic mismatch between training and testing conditions. This mismatch is often caused by various kinds of environmental noise or distortion. To reduce the effect of mismatch, feature normalization, feature enhancement and model adaptation have been studied intensively. Cepstral mean normalization (CMN), mean and variance normalization (MVN), and histogram equalization (HEQ) are well-known methods of feature normalization. Stereo-based piecewise linear compensation for environments (SPLICE) is one of the feature enhancement methods. In this paper, we describe how to combine these methods to effectively improve the robustness of ASR systems. In the experiments performed on the Aurora-2 database, a good combination showed a 41% improvement in word error rate over SPLICE only, and a 25% improvement over the conventional combination of SPLICE and CMN.
Content from these authors
© 2012 Research Institute of Signal Processing, Japan
Previous article Next article
feedback
Top