Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
37th (2023)
Session ID : 2G1-OS-21c-03
Conference information

Multimodal Information Integration with Iterative Amortized Inference
*Yuta OSHIMAMasahiro SUZUKIYutaka MATSUO
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

Multimodal variational autoencoders can acquire a latent representation that integrates information from all modalities by learning an inference model. However, when we want to obtain the shared representation from an arbitrary modality, other modality inputs are missing, which prevents proper inference of the representation. In this study, we reconsider the missing modality problem as part of the amortization gap between amortization inference from any modality and multimodal ELBO, and propose a method to appropriately obtain a shared representation from a single modality input by using iterative amortized inference. However, since multimodal ELBO must be evaluated in the process of iterative amortized inference, missing modality inputs are also required. We, therefore, prepare an inference model that takes only the modality to be inferred as input, distill iterative amortized inference as the teacher and the newly prepared inference model as the student, and verify that an inference model that can acquire a shared representation from a single modality is obtained.

Content from these authors
© 2023 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top