音声と騒音の密度比推定を用いた音声区間検出法

太刀岡 勇気; 花沢 利行; 成田 知宏; 石井 純

doi:10.1541/ieejeiss.133.1549

抄録

In this paper, we propose a robust voice activity detection (VAD) method that uses a density ratio model. For VAD under highly noisy environments, the likelihood ratio test (LRT) is effective. Conventional LRT constructs speech and noise models, calculates the likelihood of each model, and takes the ratio of those likelihoods to detect speech. Although some improved LRT have been proposed, in conventional LRT, it has not been taken into account that the likelihood ratio of speech and noise model is required, not the likelihood of each model. The proposed method directly estimates the likelihood ratio without calculating each likelihood using an density ratio model obtained in advance by density ratio estimation procedure. Moreover, there is the problem of determining thresholds, which are used for VAD and significantly affect its performance. We propose a method that automatically determines thresholds using discriminant analysis. The experiments show that the proposed method is more effective than conventional methods especially under non-stationary noisy environments.

著者関連情報

お気に入り & アラート

閲覧履歴

発行機関からのお知らせ

【電気学会会員の方】購読している論文誌を無料でご覧いただけます（会員ご本人のみの個人としての利用に限ります）。購読者番号欄にMyページへのログインIDを，パスワード欄に生年月日8ケタ（西暦，半角数字。例：19800303）を入力して下さい。

ダウンロード

論文(PDF)の閲覧方法はこちら
閲覧方法 (389.7K)

前身誌

電気学会論文誌. C

電氣學會雜誌

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）