主催: (社)計測自動制御学会システムインテグレーション部門
p. 117
Speech-to-speech translation has been studied to realize natural human communication beyond language barriers. Toward further multi-modal natural communication, visual information such as face and lip movements will be necessary. In this paper, we introduce a multi-modal English-to-Japanese and Japanese-to-English translation system that also translates the speaker’s speech motion while synchronizing it to the translated speech.