EnvX：エージェントAIによるあらゆるもののエージェント化

要旨

オープンソースリポジトリの広範な利用可能性により、再利用可能なソフトウェアコンポーネントの膨大なコレクションが生まれましたが、その利用は依然として手動で、エラーが発生しやすく、断片的なままです。開発者はドキュメントを参照し、APIを理解し、統合コードを記述する必要があり、効率的なソフトウェア再利用への大きな障壁となっています。これを解決するため、我々はEnvXを提案します。EnvXは、Agentic AIを活用してGitHubリポジトリをエージェント化し、自然言語でのインタラクションやエージェント間のコラボレーションが可能な、知的で自律的なエージェントに変革するフレームワークです。既存のアプローチがリポジトリを静的なコードリソースとして扱うのに対し、EnvXはそれらをアクティブなエージェントとして再構築します。これは3つのフェーズからなるプロセスで実現されます：(1) TODOガイドによる環境初期化、これにより必要な依存関係、データ、検証用データセットがセットアップされます；(2) 人間に沿ったエージェント的オートメーション、これによりリポジトリ固有のエージェントが現実世界のタスクを自律的に実行できるようになります；(3) エージェント間（A2A）プロトコル、これにより複数のエージェントが協力できるようになります。大規模言語モデルの能力と構造化されたツール統合を組み合わせることで、EnvXはコード生成だけでなく、リポジトリ機能の理解、初期化、運用化の全プロセスを自動化します。我々はEnvXをGitTaskBenchベンチマークで評価し、画像処理、音声認識、ドキュメント分析、ビデオ操作などの分野にわたる18のリポジトリを使用しました。その結果、EnvXは74.07%の実行完了率と51.85%のタスク合格率を達成し、既存のフレームワークを上回りました。ケーススタディでは、A2Aプロトコルを介したマルチリポジトリコラボレーションを可能にするEnvXの能力がさらに実証されています。この研究は、リポジトリを受動的なコードリソースとして扱うことから、知的でインタラクティブなエージェントとして扱うことへの転換を示し、オープンソースエコシステム内でのアクセシビリティとコラボレーションを促進します。

English

The widespread availability of open-source repositories has led to a vast collection of reusable software components, yet their utilization remains manual, error-prone, and disconnected. Developers must navigate documentation, understand APIs, and write integration code, creating significant barriers to efficient software reuse. To address this, we present EnvX, a framework that leverages Agentic AI to agentize GitHub repositories, transforming them into intelligent, autonomous agents capable of natural language interaction and inter-agent collaboration. Unlike existing approaches that treat repositories as static code resources, EnvX reimagines them as active agents through a three-phase process: (1) TODO-guided environment initialization, which sets up the necessary dependencies, data, and validation datasets; (2) human-aligned agentic automation, allowing repository-specific agents to autonomously perform real-world tasks; and (3) Agent-to-Agent (A2A) protocol, enabling multiple agents to collaborate. By combining large language model capabilities with structured tool integration, EnvX automates not just code generation, but the entire process of understanding, initializing, and operationalizing repository functionality. We evaluate EnvX on the GitTaskBench benchmark, using 18 repositories across domains such as image processing, speech recognition, document analysis, and video manipulation. Our results show that EnvX achieves a 74.07% execution completion rate and 51.85% task pass rate, outperforming existing frameworks. Case studies further demonstrate EnvX's ability to enable multi-repository collaboration via the A2A protocol. This work marks a shift from treating repositories as passive code resources to intelligent, interactive agents, fostering greater accessibility and collaboration within the open-source ecosystem.

EnvX：エージェントAIによるあらゆるもののエージェント化

EnvX: Agentize Everything with Agentic AI

要旨

Support