IEICE Transactions on Information and Systems
Online ISSN : 1745-1361
Print ISSN : 0916-8532
Regular Section
Statistical Approaches to Excitation Modeling in HMM-Based Speech Synthesis
June Sig SUNGDoo Hwa HONGHyun Woo KOONam Soo KIM
著者情報
ジャーナル フリー

2013 年 E96.D 巻 2 号 p. 379-382

詳細
抄録
In our previous study, we proposed the waveform interpolation (WI) approach to model the excitation signals for hidden Markov model (HMM)-based speech synthesis. This letter presents several techniques to improve excitation modeling within the WI framework. We propose both the time domain and frequency domain zero padding techniques to reduce the spectral distortion inherent in the synthesized excitation signal. Furthermore, we apply non-negative matrix factorization (NMF) to obtain a low-dimensional representation of the excitation signals. From a number of experiments, including a subjective listening test, the proposed method has been found to enhance the performance of the conventional excitation modeling techniques.
著者関連情報
© 2013 The Institute of Electronics, Information and Communication Engineers
前の記事 次の記事
feedback
Top