ChatPaper.aiChatPaper

利用大型语言模型进行创造性机器人工具使用

Creative Robot Tool Use with Large Language Models

October 19, 2023
作者: Mengdi Xu, Peide Huang, Wenhao Yu, Shiqi Liu, Xilun Zhang, Yaru Niu, Tingnan Zhang, Fei Xia, Jie Tan, Ding Zhao
cs.AI

摘要

工具使用是高级智能的标志,既体现在动物行为中,也体现在机器人的能力中。本文研究了赋予机器人创造性使用工具的能力的可行性,这涉及到涉及隐含物理约束和长期规划的任务。利用大型语言模型(LLMs),我们开发了RoboTool,这是一个系统,它接受自然语言指令,并输出控制机器人在模拟和真实环境中的可执行代码。RoboTool包括四个关键组件:(i)“分析器”,用于解释自然语言以识别关键任务相关概念,(ii)“规划器”,根据语言输入和关键概念生成全面的策略,(iii)“计算器”,计算每个技能的参数,以及(iv)“编码器”,将这些计划转换为可执行的Python代码。我们的结果表明,RoboTool不仅可以理解显式或隐式的物理约束和环境因素,还可以展示创造性的工具使用。与依赖显式优化的传统任务和运动规划(TAMP)方法不同,我们基于LLM的系统为复杂的机器人任务提供了更灵活、高效和用户友好的解决方案。通过大量实验,我们验证了RoboTool在处理否则无法实现的需要创造性工具使用的任务方面表现出色,从而拓展了机器人系统的能力。我们的项目页面上提供了演示:https://creative-robotool.github.io/。
English
Tool use is a hallmark of advanced intelligence, exemplified in both animal behavior and robotic capabilities. This paper investigates the feasibility of imbuing robots with the ability to creatively use tools in tasks that involve implicit physical constraints and long-term planning. Leveraging Large Language Models (LLMs), we develop RoboTool, a system that accepts natural language instructions and outputs executable code for controlling robots in both simulated and real-world environments. RoboTool incorporates four pivotal components: (i) an "Analyzer" that interprets natural language to discern key task-related concepts, (ii) a "Planner" that generates comprehensive strategies based on the language input and key concepts, (iii) a "Calculator" that computes parameters for each skill, and (iv) a "Coder" that translates these plans into executable Python code. Our results show that RoboTool can not only comprehend explicit or implicit physical constraints and environmental factors but also demonstrate creative tool use. Unlike traditional Task and Motion Planning (TAMP) methods that rely on explicit optimization, our LLM-based system offers a more flexible, efficient, and user-friendly solution for complex robotics tasks. Through extensive experiments, we validate that RoboTool is proficient in handling tasks that would otherwise be infeasible without the creative use of tools, thereby expanding the capabilities of robotic systems. Demos are available on our project page: https://creative-robotool.github.io/.
PDF91December 15, 2024