IEEJ Transactions on Electronics, Information and Systems
Online ISSN : 1348-8155
Print ISSN : 0385-4221
ISSN-L : 0385-4221
HMM Speech Recognition Using Fusion of Visual and Auditory Information
Akio OgiharaAkira ShintaniNaoshi DoiKunio Fukunaga
Author information
Keywords: HMM
JOURNAL FREE ACCESS

1995 Volume 115 Issue 11 Pages 1317-1324

Details
Abstract
In the field of speech recognition, many researchers have proposed speech recognition methods using auditory information like acoustic signal or visual information like shape and motion of lips. Auditory information has valid features for speech recognition. but it is difficult to accomplish speech recognition in noisy environment. On the other side. visual information has advantage to accomplish speech recognition in noisy environment, but it is difficult to extract effective features for speech recognition. Thus, in case of using either auditory information or visual informaion. it is difficult to accomplish speech recognition perfectly.
In this paper. We propose a method to fuse auditory information and visual information in order to realize more accurate speech recognition. The proposed method consists of two processes: (1) two probabilities for auditory information and visual information are calculated by HMM, (2) these probabilities are fused by using linear combination. We have performed speech recognition experiments of isolated words and have confirmed the validity of the proposed method.
Content from these authors
© The Institute of Electrical Engineers of Japan
Previous article Next article
feedback
Top