Journal of the Acoustical Society of Japan (E)
Online ISSN : 2185-3509
Print ISSN : 0388-2861
ISSN-L : 0388-2861
Japanese digits recognition by neural networks using vocal tract shapes
Hiroshi KinugasaHiroyuki KamataYoshihisa Ishida
著者情報
ジャーナル フリー

1993 年 14 巻 2 号 p. 55-62

詳細
抄録
This paper presents a new system for spoken Japanese digits recognition by a neural network using vocal tract shapes. The vocal tract shape is a suitable parameter for synthesis or recognition. The vocal tract shapes are used for the neural network as input data. We first propose a simple method by which the vocal tract shape is directly estimated from speech waves. A three-layered neural network is used in our recogni tionsystem. The network learning algorithms utilized here are conjugate gradient (CG) algorithm and backpropagation (BP) algorithm. Finally, we show the recognition results to prove the effectiveness of our method, and we show that the CG algorithm has several advantages compared to the BP algorithm.
著者関連情報
© The Acoustical Society of Japan
次の記事
feedback
Top