AgentSPEX：エージェント仕様記述および実行言語

要旨

言語モデルエージェントシステムでは、一般的に反応的プロンプティングが用いられています。これは単一の指示によってモデルをオープンエンドな推論とツール使用の連鎖へ導く手法ですが、制御フローや中間状態が暗黙的となるため、エージェントの挙動を制御することが難しくなる可能性があります。LangGraph、DSPy、CrewAIなどのオーケストレーションフレームワークは明示的なワークフロー定義によって構造化を図っていますが、ワークフロー論理がPythonと密結合するため、エージェントの保守や修正が困難になります。本論文では、明示的な制御フローとモジュラー構造を持つLLMエージェントワークフローを定義するためのAgentSPEX（Agent Specification and Execution Language）を提案します。これに加えて、カスタマイズ可能なエージェントハーネスも提供します。AgentSPEXは型付きステップ、分岐とループ、並列実行、再利用可能なサブモジュール、明示的状態管理をサポートし、これらのワークフローは、ツールアクセス、サンドボックス化された仮想環境、チェックポイント機能、検証、ロギングを備えたエージェントハーネス内で実行されます。さらに、作成と検査のための同期化されたグラフビューとワークフロービューを備えたビジュアルエディタを提供します。深層研究および科学研究用の即時利用可能なエージェントを含め、7つのベンチマークでAgentSPEXを評価します。最後に、ユーザー調査を通じて、AgentSPEXが既存の一般的なエージェントフレームワークよりも解釈性とアクセシビリティに優れたワークフロー作成パラダイムを提供することを示します。

English

Language-model agent systems commonly rely on reactive prompting, in which a single instruction guides the model through an open-ended sequence of reasoning and tool-use steps, leaving control flow and intermediate state implicit and making agent behavior potentially difficult to control. Orchestration frameworks such as LangGraph, DSPy, and CrewAI impose greater structure through explicit workflow definitions, but tightly couple workflow logic with Python, making agents difficult to maintain and modify. In this paper, we introduce AgentSPEX, an Agent SPecification and EXecution Language for specifying LLM-agent workflows with explicit control flow and modular structure, along with a customizable agent harness. AgentSPEX supports typed steps, branching and loops, parallel execution, reusable submodules, and explicit state management, and these workflows execute within an agent harness that provides tool access, a sandboxed virtual environment, and support for checkpointing, verification, and logging. Furthermore, we provide a visual editor with synchronized graph and workflow views for authoring and inspection. We include ready-to-use agents for deep research and scientific research, and we evaluate AgentSPEX on 7 benchmarks. Finally, we show through a user study that AgentSPEX provides a more interpretable and accessible workflow-authoring paradigm than a popular existing agent framework.

AgentSPEX：エージェント仕様記述および実行言語

AgentSPEX: An Agent SPecification and EXecution Language

要旨

Support