CGアバター対話における音声からの頭部動作および表情の自動生成

藤岡 侑貴; 上乃 聖; 李 晃伸

doi:10.11517/pjsai.JSAI2023.0_4Xin129

37th (2023)

Session ID : 4Xin1-29

DOI https://doi.org/10.11517/pjsai.JSAI2023.0_4Xin129

Conference information

Host: The Japanese Society for Artificial Intelligence

Name : The 37th Annual Conference of the Japanese Society for Artificial Intelligence

Number : 37

Location : [in Japanese]

Date : June 06, 2023 - June 09, 2023

Automatic generation of head motion and facial animation from speech in CG avatar dialogue

*Yuki FUJIOKA, Sei UENO, Akinobu LEE

Author information

Keywords: motion generation, CG Avatar, multi-modal interaction

CONFERENCE PROCEEDINGS FREE ACCESS

Details

Abstract

In recent years, commnication through avatars has become popular and been expected to apply applications. However, operating the avatar can be burdensome as it requires not only speech but also the use of face, head, and hand motions simultaneously. To reduce the burden on the operator, we propose Speech2motion, a model that automatically generates CG avatar motion from speech. In this work, we focus on the motions in conversation, and the Speech2motion model uses LSTM-based neural networks to predict head motion and facial animation. We recorded 70 minites of motion data along with the speech of one speaker during conversation. We then trained the Speech2motion model using the recorded data. Experimental evaluation shows our proposed model achieves a mean opinion score (MOS) of 3.07 in naturalness of generating the motions.

Corresponding author

Conference information

Register with J-STAGE for free!