JoyAgent-JDGenie: GAIA에 대한 기술 보고서

초록

대규모 언어 모델은 복잡한 현실 세계의 과제를 위해 점점 더 자율 에이전트로 배포되고 있지만, 기존 시스템은 견고성과 적응성을 위한 통합 설계 없이 개별적인 개선에 초점을 맞추는 경우가 많습니다. 우리는 세 가지 핵심 구성 요소를 통합한 범용 에이전트 아키텍처를 제안합니다: 계획 및 실행 에이전트와 비평 모델 투표를 결합한 집단적 다중 에이전트 프레임워크, 작업, 의미, 절차적 계층을 아우르는 계층적 메모리 시스템, 그리고 검색, 코드 실행, 다중 모드 파싱을 위한 정교한 도구 세트입니다. 포괄적인 벤치마크에서 평가한 결과, 우리의 프레임워크는 오픈소스 기준선을 꾸준히 능가하며 독점 시스템의 성능에 근접했습니다. 이러한 결과는 시스템 수준의 통합의 중요성을 보여주며, 다양한 도메인과 작업에 걸쳐 운영할 수 있는 확장성, 회복력, 적응성을 갖춘 AI 어시스턴트로 나아가는 길을 제시합니다.

English

Large Language Models are increasingly deployed as autonomous agents for complex real-world tasks, yet existing systems often focus on isolated improvements without a unifying design for robustness and adaptability. We propose a generalist agent architecture that integrates three core components: a collective multi-agent framework combining planning and execution agents with critic model voting, a hierarchical memory system spanning working, semantic, and procedural layers, and a refined tool suite for search, code execution, and multimodal parsing. Evaluated on a comprehensive benchmark, our framework consistently outperforms open-source baselines and approaches the performance of proprietary systems. These results demonstrate the importance of system-level integration and highlight a path toward scalable, resilient, and adaptive AI assistants capable of operating across diverse domains and tasks.

JoyAgent-JDGenie: GAIA에 대한 기술 보고서

JoyAgent-JDGenie: Technical Report on the GAIA

초록

Support