Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
36th (2022)
Session ID : 3Yin2-23
Conference information

Image Captioning that Reflects the Intent of the Explainer based on Tracing with a Pen.
*Sayako WATANABEIchiro KOBAYASHI
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

In recent years, research on image caption generation has evolved to include not only the generation of image captions based on information obtained from image preprocessing, but also the generation of captions based on the user's interest in the image by providing additional information corresponding to the viewpoint, called control signals, to the image processing information. In this paper, we propose a new method to generate captions based on the user's interests. In general, when people describe the image, they usually use their fingers to trace the object they want to describe. In this study, we consider tracing the image as a control signal. And, we propose an interactive generating image caption method that is more in line with the explainer by reflecting the meaning of the traces.

Content from these authors
© 2022 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top