日本神経回路学会誌
Online ISSN : 1883-0455
Print ISSN : 1340-766X
ISSN-L : 1340-766X
研究論文
砂時計型ニューラルネットワークによる日本語5母音の特徴をとらえた音声合成パラメータの抽出
清水 忠昭木本 雅也吉村 宏紀井須 尚紀菅田 一博
著者情報
ジャーナル フリー

2004 年 11 巻 4 号 p. 167-175

詳細
抄録

We showed a new scheme to characterize speech from LSP parameters by 5 layers sandglass type nonlinear neural network (SNN(NL5)). In order to synthesize speech, we take advantage of useful abilities of SNN(NL5) for compressing and restoring the information. We performed learning experiments on LSP parameters of 5 vowels to investigate the ability of SNN. The followings were verified, 1) the distribution of LSP parameters compressed by SNN(NL5) are similar to the distribution of F1-F2 formants plane. 2) Nonlinear output function of neural elements in second and fourth layers of SNN(NL5) work effectively from view point of separating the distribution of vowels. 3) In order to prevent SNN(NL5) from over learning, there exists the optimum numbers of neural elements in second and fourth layers. For 14 orders of LSP parameters, this number was determined to be 20. 4) There is a preferable property on the plane to separate the vowels distinctively when the restoring error of LSP parameters becomes less. 5) SNN(NL5) can restore the LSP parameters with accuracy enough to synthesize speech from the compressed parameters.

著者関連情報
© 2004 日本神経回路学会
前の記事 次の記事
feedback
Top