JoyAgent-JDGenie: GAIAに関する技術レポート

要旨

大規模言語モデルは、複雑な現実世界のタスクに対する自律エージェントとしてますます展開されていますが、既存のシステムはしばしば孤立した改善に焦点を当て、堅牢性と適応性を統合した設計を欠いています。我々は、3つのコアコンポーネントを統合した汎用エージェントアーキテクチャを提案します。それは、計画と実行エージェントを批評モデルの投票と組み合わせた集団マルチエージェントフレームワーク、作業、意味、手続きの各層にまたがる階層型メモリシステム、そして検索、コード実行、マルチモーダル解析のための洗練されたツールスイートです。包括的なベンチマークで評価された結果、我々のフレームワークはオープンソースのベースラインを一貫して上回り、プロプライエタリシステムの性能に迫りました。これらの結果は、システムレベルの統合の重要性を示し、多様な領域やタスクにわたって動作可能なスケーラブルで回復力があり適応性の高いAIアシスタントへの道筋を強調しています。

English

Large Language Models are increasingly deployed as autonomous agents for complex real-world tasks, yet existing systems often focus on isolated improvements without a unifying design for robustness and adaptability. We propose a generalist agent architecture that integrates three core components: a collective multi-agent framework combining planning and execution agents with critic model voting, a hierarchical memory system spanning working, semantic, and procedural layers, and a refined tool suite for search, code execution, and multimodal parsing. Evaluated on a comprehensive benchmark, our framework consistently outperforms open-source baselines and approaches the performance of proprietary systems. These results demonstrate the importance of system-level integration and highlight a path toward scalable, resilient, and adaptive AI assistants capable of operating across diverse domains and tasks.

JoyAgent-JDGenie: GAIAに関する技術レポート

JoyAgent-JDGenie: Technical Report on the GAIA

要旨

Support