人工知能学会論文誌
Online ISSN : 1346-8030
Print ISSN : 1346-0714
ISSN-L : 1346-0714
原著論文
話し方種別情報を含むテキスト対話を活用した表現豊かなテキスト音声合成
本間 幸徳金川 裕紀小林 のぞみ井島 勇祐齋藤 邦子
著者情報
ジャーナル フリー

2023 年 38 巻 3 号 p. F-MA7_1-12

詳細
抄録

This paper aims to generate expressive speech for integration with a robot and AI character dialogue systems. To generate expressive speech, some researchers have proposed using labels that express specific dialogue acts and emotions (i.e., speaking style information). Our approach is to use the speaking style information as an intermediate representation and to train a model for inferring the speaking style information from the text and a speech synthesis model independently. Using a model that infers speaking style information from text, we construct a method that can generate expressive speech for text in the dialogue domain, outside the scope of speech synthesis training. The method first estimates the labels corresponding to the speaking style information for the input text. Then, the estimated labels and the input text are used to generate speech using a speech synthesis model. Experiments show that our method effectively improves the accuracy of text classification of speaking style labels. Subjective evaluation experiments show that our method can produce more expressive speech than conventional methods.

著者関連情報
© 人工知能学会2023
前の記事 次の記事
feedback
Top