反復償却推論によるマルチモーダル情報統合の改善

大島 佑太; 鈴木 雅大; 松尾 豊

doi:10.11517/pjsai.JSAI2023.0_2G1OS21c03

Abstract

Multimodal variational autoencoders can acquire a latent representation that integrates information from all modalities by learning an inference model. However, when we want to obtain the shared representation from an arbitrary modality, other modality inputs are missing, which prevents proper inference of the representation. In this study, we reconsider the missing modality problem as part of the amortization gap between amortization inference from any modality and multimodal ELBO, and propose a method to appropriately obtain a shared representation from a single modality input by using iterative amortized inference. However, since multimodal ELBO must be evaluated in the process of iterative amortized inference, missing modality inputs are also required. We, therefore, prepare an inference model that takes only the modality to be inferred as input, distill iterative amortized inference as the teacher and the newly prepared inference model as the student, and verify that an inference model that can acquire a shared representation from a single modality is obtained.

Content from these authors

Favorites & Alerts

Corresponding author

Conference information

Register with J-STAGE for free!