Proceedings of the ISCIE International Symposium on Stochastic Systems Theory and its Applications
Online ISSN : 2188-4749
Print ISSN : 2188-4730
第37回ISCIE「確率システム理論と応用」国際シンポジウム(2005年10月, 大阪茨木)
The KanNon System - vowel recognition using time-delay neural networks
K. NakamuroK. HarukiS. Sugimoto
著者情報
ジャーナル フリー

2006 年 2006 巻 p. 131-136

詳細
抄録
We develop the real-time speech visualization system called “KanNon”[1, 2] which supports speech communication of deaf people. The KanNon system presents several information of the speech such as loudness, pitch, sound spectrogram and characters by speech recognition system in real-time. In the present system, we are adapting a word unit speech recognition system using large-scale dictionary. However the KanNon system is required quick and simple display of speech contents for smooth communication. For this purpose, we apply phonemic speech recognition system for Japanese 5 vowels using “Time-Delay Neural Network (TDNN)”. Further, we developed speech detection, voiced/unvoiced (v/uv) detection and change detection algorithms in the KanNon system. Finally, we show experimental results using real speech data.
著者関連情報
© 2006 ISCIE Symposium on Stochastic Systems Theory and Its Applications
前の記事 次の記事
feedback
Top