Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
37th (2023)
Session ID : 4O2-GS-8-05
Conference information

Text-Based Goal Definition in an Affordance-Driven Robot Action Planning System
*Mami KUROISHISolvi ARNOLDTadashi ADACHIKimitoshi YAMAZAKI
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

For robots to operate effectively in the real world, the ability to autonomously generate action plans is indispensable. This requires reasoning through future states and prediction of action effects at fine granularity. For use in human support scenarios, it is further important that planning goals can be provided in the form of human language. In previous research, we proposed a robot action planning system that generates action plans for goals specified at run time [Arnold 2023]. However, goals had to be specified as image patches depicting the goal condition, which is impractical in practice. Here, we extend the action planning system so that goals can be specified in text format, by integrating a modified CLIP [Radford 2021] model into the planning system. During plan generation, the system evaluates plans by predicting plan outcomes in image form and comparing these predictions to the goal text using CLIP. This paper focuses on evaluation of CLIP’s potential for relating images and text in our task domain. We also conduct a verification experiment of the integrated planning system and show that the system can generate plans based on goals specified in text format.

Content from these authors
© 2023 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top