The KanNon System - vowel recognition using time-delay neural networks

K. Nakamuro; K. Haruki; S. Sugimoto

doi:10.5687/sss.2006.131

抄録

We develop the real-time speech visualization system called “KanNon”[1, 2] which supports speech communication of deaf people. The KanNon system presents several information of the speech such as loudness, pitch, sound spectrogram and characters by speech recognition system in real-time. In the present system, we are adapting a word unit speech recognition system using large-scale dictionary. However the KanNon system is required quick and simple display of speech contents for smooth communication. For this purpose, we apply phonemic speech recognition system for Japanese 5 vowels using “Time-Delay Neural Network (TDNN)”. Further, we developed speech detection, voiced/unvoiced (v/uv) detection and change detection algorithms in the KanNon system. Finally, we show experimental results using real speech data.

著者関連情報

お気に入り & アラート

お気に入りに追加
追加情報アラート
被引用アラート
認証解除アラート

閲覧履歴

トリフルオロメチル不斉構造を持つ強誘電性液晶の合成とその性質
Auxin-Regulated Gene

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）