Posterior Collapseの情報識別可能性による解釈と条件付き相互情報量最大化を用いた対策

阿久澤 圭; 岩澤 有祐; 松尾 豊

doi:10.11517/pjsai.JSAI2020.0_2D4OS18a05

Abstract

Variational Autoencoder (VAE) training suffers from posterior collapse, which means the decoders of VAEs ignore latent variables. In this paper, we argue that {\em I-unidentifiable} data generating process, which is assumed by several existing VAEs, induces posterior collapse. This is because in such an {\em I-unidentifiable} data generating process, the information that a particular latent variable is designed to acquire is easily acquired by other latent variables without sacrificing log-likelihood. We show that this perspective gives a unified explanation for posterior collapse, using VAE with autoregressive decoder and disentangled sequential autoencoder as examples. In addition, we propose maximizing conditional mutual information with adversarial training to alleviate the unidentifiability issue, which does not require specific constraints on model architectures or latent variable structures. Empirically our method mitigated posterior collapse in the above two models and improved the rate-distortion curve.

Content from these authors

Favorites & Alerts

Corresponding author

Conference information

Register with J-STAGE for free!