ChatPaper.aiChatPaper

OceanGym:水下智能体基准测试环境

OceanGym: A Benchmark Environment for Underwater Embodied Agents

September 30, 2025
作者: Yida Xue, Mingjun Mao, Xiangyuan Ru, Yuqi Zhu, Baochang Ren, Shuofei Qiao, Mengru Wang, Shumin Deng, Xinyu An, Ningyu Zhang, Ying Chen, Huajun Chen
cs.AI

摘要

我们推出OceanGym,这是首个面向海洋水下具身智能体的综合基准平台,旨在推动AI在最具挑战性的现实环境之一中的发展。与陆地或空中领域不同,水下环境带来了极端的感知与决策难题,包括低能见度、动态洋流等,使得智能体的有效部署异常困难。OceanGym囊括了八个真实任务领域,并构建了一个由多模态大语言模型(MLLMs)驱动的统一智能体框架,该框架集成了感知、记忆与序列决策能力。智能体需理解光学与声呐数据,在复杂环境中自主探索,并在这些严苛条件下完成长期目标。大量实验表明,当前最先进的MLLM驱动智能体与人类专家之间仍存在显著差距,凸显了海洋水下环境中感知、规划及适应性的持续挑战。通过提供高保真、精心设计的平台,OceanGym为开发鲁棒的具身AI及将这些能力迁移至现实世界的自主海洋水下航行器建立了试验场,标志着向能够在地球最后未探索疆域之一中运作的智能体迈出了决定性的一步。代码与数据可在https://github.com/OceanGPT/OceanGym获取。
English
We introduce OceanGym, the first comprehensive benchmark for ocean underwater embodied agents, designed to advance AI in one of the most demanding real-world environments. Unlike terrestrial or aerial domains, underwater settings present extreme perceptual and decision-making challenges, including low visibility, dynamic ocean currents, making effective agent deployment exceptionally difficult. OceanGym encompasses eight realistic task domains and a unified agent framework driven by Multi-modal Large Language Models (MLLMs), which integrates perception, memory, and sequential decision-making. Agents are required to comprehend optical and sonar data, autonomously explore complex environments, and accomplish long-horizon objectives under these harsh conditions. Extensive experiments reveal substantial gaps between state-of-the-art MLLM-driven agents and human experts, highlighting the persistent difficulty of perception, planning, and adaptability in ocean underwater environments. By providing a high-fidelity, rigorously designed platform, OceanGym establishes a testbed for developing robust embodied AI and transferring these capabilities to real-world autonomous ocean underwater vehicles, marking a decisive step toward intelligent agents capable of operating in one of Earth's last unexplored frontiers. The code and data are available at https://github.com/OceanGPT/OceanGym.
PDF221October 1, 2025