AWorld: 에이전트 중심 AI를 위한 훈련 레시피 구성

초록

실습을 통한 학습 패러다임은 능력 있는 에이전트 AI 시스템 개발에 있어 핵심적이지만, 비효율적인 경험 생성으로 인해 심각한 제약을 받고 있으며, 이는 특히 GAIA와 같은 복잡한 벤치마크에서 두드러집니다. 이를 해결하기 위해 우리는 대규모 에이전트-환경 상호작용을 위해 설계된 오픈소스 시스템인 AWorld를 소개합니다. AWorld는 클러스터에 작업을 분산시켜 표준 단일 노드 순차 실행 대비 경험 수집 속도를 14.6배 가속합니다. 이 중요한 속도 향상은 광범위한 강화 학습을 실용적이고 확장 가능하게 만듭니다. 이 기능을 활용하여, 우리는 Qwen3-32B 기반 에이전트를 훈련시켰으며, 이는 기본 모델을 크게 능가하여 GAIA 정확도를 21.59%에서 32.23%로 증가시켰습니다. 벤치마크의 가장 어려운 수준에서 우리의 에이전트는 16.33%의 점수를 달성하며, 주요 독점 모델의 성능을 뛰어넘었습니다. 우리의 오픈소스 시스템과 그 결과물인 에이전트는 효율적인 상호작용부터 입증된 모델 개선까지 완전한 에이전트 AI 훈련 파이프라인을 위한 실용적인 청사진을 제공합니다.

English

The learning from practice paradigm is crucial for developing capable Agentic AI systems, yet it is severely hampered by inefficient experience generation, a bottleneck especially pronounced in complex benchmarks like GAIA. To address this, we introduce AWorld, an open-source system engineered for large-scale agent-environment interaction. By distributing tasks across a cluster, AWorld accelerates experience collection by 14.6x compared to standard single-node, sequential execution. This critical speedup makes extensive reinforcement learning practical and scalable. Leveraging this capability, we trained a Qwen3-32B-based agent that significantly outperforms its base model, increasing its overall GAIA accuracy from 21.59% to 32.23%. On the benchmark's most challenging levels, our agent achieves a score of 16.33%, surpassing the performance of leading proprietary models. Our open-source system and resulting agent provide a practical blueprint for a complete agentic AI training pipeline, from efficient interaction to demonstrable model improvement.

AWorld: 에이전트 중심 AI를 위한 훈련 레시피 구성

AWorld: Orchestrating the Training Recipe for Agentic AI

초록

Support