共同注意を用いた複数物体環境下におけるロボットの語意学習

西村 卓真; 長野 匡隼; 中村 友昭

doi:10.11517/pjsai.JSAI2021.0_2J4GS8c03

35th (2021)

Session ID : 2J4-GS-8c-03

DOI https://doi.org/10.11517/pjsai.JSAI2021.0_2J4GS8c03

Conference information

Host: The Japanese Society for Artificial Intelligence

Name : The 35th Annual Conference of the Japanese Society for Artificial Intelligence

Number : 35

Location : [in Japanese]

Date : June 08, 2021 - June 11, 2021

Acquisition of Word Meaning for Robot Using Joint Attention in a Cluttered Scene

*Takuma NISHIMURA, Masatoshi NAGANO, Tomoaki NAKAMURA

Author information

Keywords: acquisition of word meaning, joint attention, multimodal latent Dirichlet allocation, region proposal network, unsupervised learning

CONFERENCE PROCEEDINGS FREE ACCESS

Details

Abstract

Humans learn the names of objects by associating words to objects. It has been reported that joint attention, which is an ability to identify the target object, facilitates the acquisition of word meaning. We believe that this ability is also important for robots to flexibly acquire new words in the daily environment through interaction with humans. In this paper, we propose an algorithm that enables robots to learn word meanings in a cluttered scene by identifying the target object utilizing joint attention and co-occurrence of words and objects. In the proposed algorithm, a robot detects multiple objects using a region proposal network and selects one of them based on joint attention and the co-occurrence of words and objects. Finally, the robot acquires the word meaning by associating the word to the selected object by multimodal latent Dirichlet allocation.

Corresponding author

Conference information

Register with J-STAGE for free!