確率分布を用いた画像テキストデータの埋め込みと検索

濱 健太; 松原 崇; 上原 邦昭

doi:10.11517/pjsai.JSAI2018.0_3L205

32nd (2018)

Session ID : 3L2-05

DOI https://doi.org/10.11517/pjsai.JSAI2018.0_3L205

Conference information

Host: The Japanese Society for Artificial Intelligence

Name : The 32nd Annual Conference of the Japanese Society for Artificial Intelligence, 2018

Number : 32

Location : [in Japanese]

Date : June 05, 2018 - June 08, 2018

Embedding and retrieval of images and text data using probability distribution

*Kenta HAMA, Takashi MATSUBARA, Kuniaki UEHARA

Author information

CONFERENCE PROCEEDINGS FREE ACCESS

Details

Abstract

Multimodal data including images, sounds, texts is accumulated on the Internet. We can expect general-purpose data representation to perform tasks such as data discrimination, generation, and retrieval on various modalities datasets. The key idea for acquiring the representation is embedding a point from a data space of each modality in a point of common space. However, if data is embedded in a point, it becomes difficult to interpret the ambiguity of the data's meaning and the inclusive relation among the data. Of course, representation of data point does not necessarily need to be a point. In this study, we embed image and text into a normal distribution in a common space. This improves the performance of image retrieval.

Corresponding author

Conference information

Register with J-STAGE for free!