Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
37th (2023)
Session ID : 1G5-OS-21b-03
Conference information

World models using latent diffusion model
Eitaro YAMATSUTA*Fumiya UCHIYAMAReiya SEKIDOYuto KAWAHARAMasahiro SUZUKIYutaka MATSUO
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

World models model the external world from limited information and can be used to predict future external states and observations for learning. In spatio-temporal prediction, reinforcement learning methods using deep generative models have attracted attention. In generative models, Imagen and Stable-diffusion based on diffusion models are known for their high image generation capability. In this study, we propose a method to generate a better latent representation from the hidden states of LSTM by changing the vision part of World Models from conventional β-VAE to latent diffusion model, and compare these methods.

Content from these authors
© 2023 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top