分散表現に基づく漫画の画像と台詞の対応識別

寺内 光; 森 直樹; 上野 未貴

doi:10.11517/pjsai.JSAI2020.0_3D5OS22b01

Abstract

The research of understanding human creations such as comics, novels, and music by artificial intelligence (AI) has become an attractive research topic in AI fields. However, creating an interesting story or comic is still one of the difficult tasks because it requires lots of human creativity. In this study, we focus on that AI can understand comics or not by using four-scene comics because four-scene comics have a clear structure and format. Lots of studies using the image or natural language models have been proposed in such tasks. However, there are few studies using a combination of images and natural language features as multi-modal data. In this study, we proposed the method of combining images and languages to understand four-scene comics utilizing deep learning. The effectiveness of the proposed method is confirmed by computer simulations taking koma prediction problems of four-scene comics as examples.

Content from these authors

Favorites & Alerts

Corresponding author

Conference information

Register with J-STAGE for free!