潜在拡散モデルを用いた世界モデルの提案

山蔦 栄太郎; 内山 史也; 関戸 麗矢; 川原 雄登; 鈴木 雅大; 松尾 豊

doi:10.11517/pjsai.JSAI2023.0_1G5OS21b03

37th (2023)

Session ID : 1G5-OS-21b-03

DOI https://doi.org/10.11517/pjsai.JSAI2023.0_1G5OS21b03

Conference information

Host: The Japanese Society for Artificial Intelligence

Name : The 37th Annual Conference of the Japanese Society for Artificial Intelligence

Number : 37

Location : [in Japanese]

Date : June 06, 2023 - June 09, 2023

World models using latent diffusion model

Eitaro YAMATSUTA, *Fumiya UCHIYAMA, Reiya SEKIDO, Yuto KAWAHARA, Masahiro SUZUKI, Yutaka MATSUO

Author information

Keywords: World models, Latent diffusion model, Rainforced Learning

CONFERENCE PROCEEDINGS FREE ACCESS

Details

Abstract

World models model the external world from limited information and can be used to predict future external states and observations for learning. In spatio-temporal prediction, reinforcement learning methods using deep generative models have attracted attention. In generative models, Imagen and Stable-diffusion based on diffusion models are known for their high image generation capability. In this study, we propose a method to generate a better latent representation from the hidden states of LSTM by changing the vision part of World Models from conventional β-VAE to latent diffusion model, and compare these methods.

Corresponding author

Conference information

Register with J-STAGE for free!