ETSI標準分散音声認識フロントエンドにおける入力系の周波数特性正規化手法

柘植 覚; 黒岩 眞吾; 獅々堀 正幹; 北 研二

doi:10.1541/ieejeiss.125.120

抄録

This paper reports an evaluation of European Telecommunications Standards Institute (ETSI) standard Distributed Speech Recognition (DSR) front-end through continuous speech recognition on a Japanese speech corpus and proposes methods, the Bias Removal Methods (BRMs), that reduce the distortion between feature parameters and the VQ codebook. Experimental results show that (1) using non-quantized features in an acoustic model training procedure can improve the recognition performance of DSR front-end features and (2) broadening the analysis band can improve the recognition performance for the same bitrate. The proposed method can improve the recognition performance in DSR condition. Notably, we observed an 18% relative improvement in the error rate using the proposed method under mismatch of channel characteristic conditions.

著者関連情報

お気に入り & アラート

閲覧履歴

発行機関からのお知らせ

【電気学会会員の方】購読している論文誌を無料でご覧いただけます（会員ご本人のみの個人としての利用に限ります）。購読者番号欄にMyページへのログインIDを，パスワード欄に生年月日8ケタ（西暦，半角数字。例：19800303）を入力して下さい。

ダウンロード

論文(PDF)の閲覧方法はこちら
閲覧方法 (389.7K)

前身誌

電気学会論文誌. C

電氣學會雜誌

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）