映像情報メディア学会誌
Online ISSN : 1881-6908
Print ISSN : 1342-6907
ISSN-L : 1342-6907
論文
音声と画像の統合によるドライバの発話区間検出
二宮 芳樹坂 義秀前野 俊希根木 大輔宮島 千代美森 健策北坂 孝幸末永 康仁
著者情報
ジャーナル フリー

2008 年 62 巻 3 号 p. 435-441

詳細
抄録
Voice activity detection is an important part of the development of speech functions for on-board car navigation and assistance systems. It is difficult to detect voice activity using only sound information in a vehicle environment that has a wide variety of sounds and noises. We propose an suitable image feature and integration method that can be used to develop a robust bimodal voice activity detection (VAD) systems using a driver's voice and facial images. We select the normal correlation value between sequential mouth images and the number of low-intensity pixels in mouth image, which we then used as the feature for VAD. We propose a system in which the discrimination function consist of the sum of weighted singles feature discrimination functions and combinations of logical addition and multiplication of singles feature discrimination functions. The experimental results show that the proposed sound and image features can be useful and that the proposed integration method has a 97% hit rate, which is 9 points better than the previous integration method at the point that false alarm rate is about 12%.
著者関連情報
© 2008 一般社団法人 映像情報メディア学会
前の記事 次の記事
feedback
Top