Proceedings of the ISCIE International Symposium on Stochastic Systems Theory and its Applications
Online ISSN : 2188-4749
Print ISSN : 2188-4730
第40回ISCIE「確率システム理論と応用」国際シンポジウム(2008年11月, 京都)
The KanNon System - phonemic recognition using Burg-MCE
K. NomuraY. RiK. FujimotoS. Sugimoto
著者情報
ジャーナル フリー

2009 年 2009 巻 p. 306-311

詳細
抄録
We have developed a real-time speech visualization system called “KanNon”[1,2] which supports speech communication of hearing-impaired people. The KanNon system presents informations of the speech such as loudness, pitch, sound spectrogram and characters by speech recognition system in real-time. In the present KanNon system, a word-unit speech recongniton system using large scale dictionary is adopted. However, the KanNon system is required quick and simple display of speech contents for smooth communication. For this purpose, we applied phonemic speech recognition system. Also, we have already proposed Japanese 5 vowels (/a/, /i/, /u/, /e/, /o/) recognition methods, applying “Time-Delay Neural Network (TDNN)” [3] and statistical pattern recognition [4].However, correct recognition rate is about 85 percent shown in Tables 1, 2 which is not so high. In this paper, therefore, we attempt to obtain better spectral features for phenemic recognition, we apply the novel spectral estimation method called Burg-MCE[5] method combining Burg method and Minimum Cross Entropy method. We apply human auditory property to power spectrum estimated by Burg-MCE method, and carry out phonemic recognition by using statiscal pattern recognition.
著者関連情報
© 2009 ISCIE Symposium on Stochastic Systems Theory and Its Applications
前の記事 次の記事
feedback
Top