Acoustical Science and Technology
Online ISSN : 1347-5177
Print ISSN : 1346-3969
ISSN-L : 0369-4232
43 巻, 4 号
選択された号の論文の3件中1~3を表示しています
PAPER
  • Tomio Takara, Ryoichi Eto
    2022 年 43 巻 4 号 p. 219-227
    発行日: 2022/07/01
    公開日: 2022/07/01
    ジャーナル フリー

    We propose a new analysis and synthesis system of speech using the genetic algorithm (GA) for the analysis and the Fujisaki's generative model of speech (Fujisaki model) for the synthesis. This system is a functional model to simulate human acquisition of speech through the process of imitation of spoken words. We represent the coarticulation effect using the Fujisaki model. We model the trial-and-error and emergent process of speech imitation using the GA. In our system, we regard "command" in the Fujisaki model as an articulatory gesture and detect it from the spectral sequence using the GA. In other words, the original phonemic target is inversely estimated automatically as the command in the Fujisaki model from the phonemically ambiguous speech spectrum caused by coarticulation. We evaluated the system by listening tests using synthesized speech. We also show that the system can represent the phenomenon of "predicted sound," which is a type of flush-lag effect unconsciously heard as a result of the normalization of coarticulation, by comparing the predicted sound with the inversely estimated sound.

ACOUSTICAL LETTERS
feedback
Top