Abstract
This paper deals with a problem where a robot identifies an object that a human asks it to bring by voice when there is a set of objects that the human and the robot can see. In this case, a human uses an expression which consistes of one or some attributes, such as color and name etc.. In this paper, we propose the method for the identication using color and object names. The multimodal information of speech and images are used for the identification.