AgentSPEX: Een Agent Specificatie en Uitvoeringstaal

Samenvatting

Taalmodel-agentsystemen maken doorgaans gebruik van reactieve prompting, waarbij een enkele instructie het model door een open opeenvolging van redeneer- en toolgebruikstappen leidt. Hierbij blijven controleflow en tussenliggende staat impliciet, wat het gedrag van de agent potentieel moeilijk te controleren maakt. Orchestratie-frameworks zoals LangGraph, DSPy en CrewAI leggen meer structuur op via expliciete workflowdefinities, maar koppelen de workflowlogica sterk aan Python, waardoor agents moeilijk te onderhouden en aan te passen zijn. In dit artikel introduceren we AgentSPEX, een Agent Specificatie- en Uitvoeringstaal voor het specificeren van LLM-agent-workflows met expliciete controleflow en modulaire structuur, samen met een aanpasbare agent-harnassing. AgentSPEX ondersteunt getypeerde stappen, vertakkingen en lussen, parallelle uitvoering, herbruikbare submodules en expliciet staatbeheer. Deze workflows worden uitgevoerd binnen een agent-harnassing die tooltoegang, een afgeschermde virtuele omgeving en ondersteuning voor checkpointing, verificatie en logging biedt. Verder bieden we een visuele editor met gesynchroniseerde grafische en workflowweergaven voor ontwerp en inspectie. We includeren kant-en-klare agents voor diepgaand onderzoek en wetenschappelijk onderzoek, en we evalueren AgentSPEX op 7 benchmarks. Tot slot tonen we via een gebruikersstudie aan dat AgentSPEX een meer interpreteerbaar en toegankelijk workflow-ontwerpparadigma biedt dan een populair bestaand agentframework.

English

Language-model agent systems commonly rely on reactive prompting, in which a single instruction guides the model through an open-ended sequence of reasoning and tool-use steps, leaving control flow and intermediate state implicit and making agent behavior potentially difficult to control. Orchestration frameworks such as LangGraph, DSPy, and CrewAI impose greater structure through explicit workflow definitions, but tightly couple workflow logic with Python, making agents difficult to maintain and modify. In this paper, we introduce AgentSPEX, an Agent SPecification and EXecution Language for specifying LLM-agent workflows with explicit control flow and modular structure, along with a customizable agent harness. AgentSPEX supports typed steps, branching and loops, parallel execution, reusable submodules, and explicit state management, and these workflows execute within an agent harness that provides tool access, a sandboxed virtual environment, and support for checkpointing, verification, and logging. Furthermore, we provide a visual editor with synchronized graph and workflow views for authoring and inspection. We include ready-to-use agents for deep research and scientific research, and we evaluate AgentSPEX on 7 benchmarks. Finally, we show through a user study that AgentSPEX provides a more interpretable and accessible workflow-authoring paradigm than a popular existing agent framework.

AgentSPEX: Een Agent Specificatie en Uitvoeringstaal

AgentSPEX: An Agent SPecification and EXecution Language

Samenvatting

Support