Von Fertigkeitstext zu Fertigkeitsstruktur: Die terminologisch-strukturell-logische Repräsentation für Agentenfertigkeiten

Zusammenfassung

LLM-Agenten setzen zunehmend auf wiederverwendbare Fähigkeiten (Skills) – Funktionspakete, die Anweisungen, Kontrollfluss, Einschränkungen und Tool-Aufrufe kombinieren. In den meisten aktuellen Agentensystemen werden Fähigkeiten jedoch noch durch textlastige Artefakte repräsentiert, darunter SKILL.md-artige Dokumente und strukturierte Datensätze, deren maschinell verwertbare Informationen größtenteils in natürlicher Sprache eingebettet bleiben. Dies stellt eine Herausforderung für skillzentrierte Agentensysteme dar: Sowohl die Verwaltung von Skill-Sammlungen als auch deren Einsatz zur Unterstützung von Agenten erfordern ein Reasoning über Aufrufschnittstellen, Ausführungsstruktur und konkrete Nebeneffekte, die oft in einer einzigen Textoberfläche verflochten sind. Eine explizite Repräsentation von Skill-Wissen könnte daher dazu beitragen, diese Artefakte für Maschinen leichter erfassbar und nutzbar zu machen. Inspiriert von Memory Organization Packets, Script Theory und Conceptual Dependency aus dem klassischen Werk von Schank und Abelson zur linguistischen Wissensrepräsentation führen wir die nach unserem Wissen erste strukturierte Repräsentation für Agenten-Skill-Artefakte ein, die skillbezogene Planungssignale, szenenbasierte Ausführungsstruktur sowie logische Handlungs- und Ressourcennutzungsinformationen entflechtet: die Scheduling-Structural-Logical (SSL)-Repräsentation. Wir implementieren SSL mit einem LLM-basierten Normalisierer und evaluieren ihn anhand eines Skill-Korpus in zwei Aufgaben: Skill-Entdeckung und Risikobewertung. Dabei übertreffen wir textbasierte Baseline-Methoden deutlich: Bei der Skill-Entdeckung verbessert SSL den MRR-Wert von 0,573 auf 0,707; bei der Risikobewertung steigert es den makro-F1-Score von 0,744 auf 0,787. Diese Ergebnisse zeigen, dass eine explizite, quellfundierte Strukturierung Agenten-Skills besser durchsuch- und überprüfbar macht. Sie legen auch nahe, dass SSL am besten als praktischer Schritt hin zu besser inspectierbaren, wiederverwendbaren und operationalisierbaren Skill-Repräsentationen für Agentensysteme zu verstehen ist – und nicht als fertiger Standard oder als End-to-End-Mechanismus zur Verwaltung und Nutzung von Fähigkeiten.

English

LLM agents increasingly rely on reusable skills, capability packages that combine instructions, control flow, constraints, and tool calls. In most current agent systems, however, skills are still represented by text-heavy artifacts, including SKILL.md-style documents and structured records whose machine-usable evidence remains embedded largely in natural-language descriptions. This poses a challenge for skill-centered agent systems: managing skill collections and using skills to support agent both require reasoning over invocation interfaces, execution structure, and concrete side effects that are often entangled in a single textual surface. An explicit representation of skill knowledge may therefore help make these artifacts easier for machines to acquire and leverage. Drawing on Memory Organization Packets, Script Theory, and Conceptual Dependency from Schank and Abelson's classical work on linguistic knowledge representation, we introduce what is, to our knowledge, the first structured representation for agent skill artifacts that disentangles skill-level scheduling signals, scene-level execution structure, and logic-level action and resource-use evidence: the Scheduling-Structural-Logical (SSL) representation. We instantiate SSL with an LLM-based normalizer and evaluate it on a corpus of skills in two tasks, Skill Discovery and Risk Assessment, and superiorly outperform the text-only baselines: in Skill Discovery, SSL improves MRR from 0.573 to 0.707; in Risk Assessment, it improves macro F1 from 0.744 to 0.787. These findings reveal that explicit, source-grounded structure makes agent skills easier to search and review. They also suggest that SSL is best understood as a practical step toward more inspectable, reusable, and operationally actionable skill representations for agent systems, rather than as a finished standard or an end-to-end mechanism for managing and using skills.

Von Fertigkeitstext zu Fertigkeitsstruktur: Die terminologisch-strukturell-logische Repräsentation für Agentenfertigkeiten

From Skill Text to Skill Structure: The Scheduling-Structural-Logical Representation for Agent Skills

Zusammenfassung

Support