画像を通じた同義文の潜在空間における対応関係の学習

孫 延君; 小林 一郎

doi:10.11517/pjsai.JSAI2021.0_4I1GS7b05

Abstract

In this study, we aim to investigate whether multimodal information can improve the understanding of uni-modal information by clarifying the relationship between the variables of each modality in the latent space. Here, we especially focus on two modalities: image and natural language, and have investigated whether a common image to synonymous sentences is useful for conversion between those two sentences through the latent space. As a result of the preliminary experiment, we confirmed that the accuracy and the efficiency of reconstructing the input sentence using the image whose content reflects that of the sentence is higher than the case without using such image.

Content from these authors

Favorites & Alerts

Corresponding author

Conference information

Register with J-STAGE for free!