Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
35th (2021)
Session ID : 2G3-GS-2e-05
Conference information

Human Motion Forecasting Using GPT-2
*Kazuki MIYAZAWATeruya INOUETakayuki NAGAI
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

Language models such as GPT-2 and BERT improve the performance of language understanding tasks and language generation. These language models have begun to be shown to be applicable not only to language but also to non-linguistic data such as images and audio. By discretizing continuous data using VQ-VAE, it is possible to treat continuous data with language models in the same way as language data. We believe this discretization and learning discrete sequences by the language model can be applied to various types of data. The purpose of this study is to verify the modeling of human motion data using VQ-VAE and GPT-2. In our experiments, we trained VQ-VAE and GPT-2 on the CMU-mocap and 3DPW mocap dataset. We validated the learned models by forecasting the future motion from the motion input of current few frames.

Content from these authors
© 2021 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top