VideoGPTのデータセットサイズに関するスケーリング則

根岸 優大; 佐藤 誠人; 海野 良介; 田畑 浩大; 渡部 泰樹; 蒲原 惇乃輔; 久米 大雅; 岡田 領; 岩澤 有祐; 松尾 豊

doi:10.11517/pjsai.JSAI2023.0_2G6OS21f05

37th (2023)

Session ID : 2G6-OS-21f-05

DOI https://doi.org/10.11517/pjsai.JSAI2023.0_2G6OS21f05

Conference information

Host: The Japanese Society for Artificial Intelligence

Name : The 37th Annual Conference of the Japanese Society for Artificial Intelligence

Number : 37

Location : [in Japanese]

Date : June 06, 2023 - June 09, 2023

Scaling Laws of Dataset Size for VideoGPT

*Masahiro NEGISHI, Makoto SATO, Ryosuke UNNO, Koudai TABATA, Taiju WATANABE, Junnosuke KAMOHARA, Taiga KUME, Ryo OKADA, Yusuke IWASAWA, Yutaka MATSUO

Author information

*Masahiro NEGISHI
The University of Tokyo
Matsuo Institute
Makoto SATO
Nara Institute of Science and Technology
Matsuo Institute
Ryosuke UNNO
The University of Tokyo
Matsuo Institute
Koudai TABATA
The University of Tokyo
Matsuo Institute
Taiju WATANABE
Waseda University
Matsuo Institute
Junnosuke KAMOHARA
Tohoku University
Matsuo Institute
Taiga KUME
Keio University
Matsuo Institute
Ryo OKADA
The University of Tokyo
Matsuo Institute
Yusuke IWASAWA
The University of Tokyo
Yutaka MATSUO
The University of Tokyo

Keywords: World Models, Scaling Laws, Dataset Size

CONFERENCE PROCEEDINGS FREE ACCESS

Details

Abstract

Over the past decade, deep learning has made significant strides in improving various domains by training large models with large-scale computational resources. Recent studies showed that large-scale transformer models perform well in diverse generative tasks, including language modeling and image modeling. Efficient training of such large-scale models requires a vast amount of data, and many fields are working on building large-scale datasets. However, despite the development in simulator environments such as CARLA and large-scale datasets such as RoboNet, the scaling to dataset size of the performance of world models, which try to acquire the temporal and spatial structure of environments, has yet to be sufficiently studied. Thus, this work experimentally proves the scaling law of a world model to dataset size. We use VideoGPT and a dataset generated by the CARLA simulator. We also show that the computational budget should mainly be used to scale up dataset size when the number of model parameters is on the order of 10⁷ or larger and the computational budget is limited.

Corresponding author

Conference information

Register with J-STAGE for free!