Host: The Japanese Society for Artificial Intelligence
Name : The 36th Annual Conference of the Japanese Society for Artificial Intelligence
Number : 36
Location : [in Japanese]
Date : June 14, 2022 - June 17, 2022
In these days, generated speech by the modern TTS system can be undistinguishable with real human's speech, and many researches have been studied even on emotionally conditioned TTS. Here we explore another way to control emotions on speech synthesis by combining facial expression data to achieve intuitive conditioning. In this paper, we share our experimental results and discuss the details.