VAEに基づく潜在発話トピックとセマンティクスを活用したマルチモーダルセンチメント予測

立石 修平; 小瀬木 悠佳; 八島 浩文; 中辻 真

doi:10.11517/pjsai.JSAI2022.0_3P4GS201

36th (2022)

Session ID : 3P4-GS-2-01

DOI https://doi.org/10.11517/pjsai.JSAI2022.0_3P4GS201

Conference information

Host: The Japanese Society for Artificial Intelligence

Name : The 36th Annual Conference of the Japanese Society for Artificial Intelligence

Number : 36

Location : [in Japanese]

Date : June 14, 2022 - June 17, 2022

Multimodal Semantic Prediction Utilizing Semantics and Latent Uttarance Topics based on Variational Auto Encoder

*Shuhei TATEISHI, Yuka OZEKI, Hirofumi YASHIMA, Makoto NAKATSUJI

Author information

Keywords: AI, Multimodal, Sentiment Analysis, Natural Language Processing

CONFERENCE PROCEEDINGS FREE ACCESS

Details

Abstract

In the field of multimodal machine learning, we are faced on the problem of how to combine multiple sources of input data to produce more accurate results than simply summarize the training results for each input data, anytime. Against this issue, we have developed a new model for multimodal sentiment analysis that superior to existing models for accuracy by using the following three elements: (1) applying semantics to each word, (2) extracting relationships between modalities using attention, and (3) adding topic information based on the latent space for the entire utterance that unifies the modality information.

Corresponding author

Conference information

Register with J-STAGE for free!