AgentStore：異種エージェントのスケーラブルな統合を専門とする汎用コンピュータアシスタント

要旨

複雑なコンピュータタスクを自動化するデジタルエージェントは、人間とコンピュータの相互作用を向上させる潜在能力が大きなため、注目を集めています。しかし、既存のエージェント手法は、特に実世界の環境でのオープンエンドのコンピュータタスクの処理において、一般化能力と特化能力において不足が見られます。Appストアの豊富な機能性に着想を得て、私たちはAgentStoreを提案します。これは、コンピュータタスクを自動化するために異種エージェントを動的に統合するために設計されたスケーラブルなプラットフォームです。AgentStoreは、ユーザーがサードパーティのエージェントを統合できるようにし、システムが能力を持続的に充実させ、急速に進化するオペレーティングシステムに適応できるようにします。さらに、私たちは、多様なエージェントを効率的に管理し、それらの特化能力と一般化能力をドメイン固有のタスクとシステム全体のタスクの両方に活用するための新しい中核となるMetaAgentとAgentToken戦略を提案します。3つの難解なベンチマークでの包括的な実験は、AgentStoreが以前の能力の狭いシステムの制限を超え、特にOSWorldベンチマークで11.21％から23.85％に大幅な改善を達成し、以前の結果を倍以上にしたことを示しています。包括的な定量的および定性的結果は、AgentStoreが一般化と特化の両方でエージェントシステムを向上させる能力を示し、特化した一般化コンピュータアシスタントの開発の可能性を強調しています。私たちのすべてのコードは、https://chengyou-jia.github.io/AgentStore-Home で公開されます。

English

Digital agents capable of automating complex computer tasks have attracted considerable attention due to their immense potential to enhance human-computer interaction. However, existing agent methods exhibit deficiencies in their generalization and specialization capabilities, especially in handling open-ended computer tasks in real-world environments. Inspired by the rich functionality of the App store, we present AgentStore, a scalable platform designed to dynamically integrate heterogeneous agents for automating computer tasks. AgentStore empowers users to integrate third-party agents, allowing the system to continuously enrich its capabilities and adapt to rapidly evolving operating systems. Additionally, we propose a novel core MetaAgent with the AgentToken strategy to efficiently manage diverse agents and utilize their specialized and generalist abilities for both domain-specific and system-wide tasks. Extensive experiments on three challenging benchmarks demonstrate that AgentStore surpasses the limitations of previous systems with narrow capabilities, particularly achieving a significant improvement from 11.21\% to 23.85\% on the OSWorld benchmark, more than doubling the previous results. Comprehensive quantitative and qualitative results further demonstrate AgentStore's ability to enhance agent systems in both generalization and specialization, underscoring its potential for developing the specialized generalist computer assistant. All our codes will be made publicly available in https://chengyou-jia.github.io/AgentStore-Home.

AgentStore：異種エージェントのスケーラブルな統合を専門とする汎用コンピュータアシスタント

AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant

要旨

Support