기술 텍스트에서 기술 구조로: 에이전트 기술의 스케줄링-구조-논리 표현

초록

LLM 에이전트는 지침, 제어 흐름, 제약 조건, 도구 호출을 결합한 능력 패키지인 재사용 가능한 스킬에 점점 더 의존하고 있습니다. 그러나 현재 대부분의 에이전트 시스템에서 스킬은 여전히 SKILL.md 스타일 문서와 기계 가용 정보가 대부분 자연어 설명에 내재된 구조화된 기록을 포함하여 텍스트 중심 아티팩트로 표현됩니다. 이는 스킬 중심 에이전트 시스템에 과제를 제기합니다: 스킬 컬렉션 관리와 에이전트 지원을 위한 스킬 활용 모두 단일 텍스트 표면에 얽혀 있는 호출 인터페이스, 실행 구조, 구체적 부수 효과에 대한 추론이 필요하기 때문입니다. 따라서 스킬 지식의 명시적 표현은 이러한 아티팩트를 기계가 획득하고 활용하기 쉽게 만드는 데 도움이 될 수 있습니다. Schank와 Abelson의 언어적 지식 표현에 관한 고전적 연구인 메모리 조직 패킷, 스크립트 이론, 개념적 의존성을 바탕으로, 우리는 스킬 수준의 스케줄링 신호, 장면 수준의 실행 구조, 논리 수준의 행동 및 자원 사용 증거를 분리하는 에이전트 스킬 아티팩트 최초의 구조화된 표현인 Scheduling-Structural-Logical(SSL) 표현을 소개합니다. 우리는 LLM 기반 정규화기를 통해 SSL을 구현하고 스킬 발견과 위험 평가 두 가지 작업에서 스킬 코퍼스에 대해 평가하여 텍스트 전용 베이스라인을 크게 능가했습니다: 스킬 발견에서 SSL은 MRR을 0.573에서 0.707로 향상시켰으며, 위험 평가에서는 macro F1을 0.744에서 0.787로 향상시켰습니다. 이러한 결과는 소스에 기반한 명시적 구조가 에이전트 스킬의 검색과 검토를 용이하게 함을 보여줍니다. 또한 SSL은 완성된 표준이나 종단간 스킬 관리/사용 메커니즘이라기보다, 에이전트 시스템을 위한 더 투명하고 재사용 가능하며 운영적으로 실행 가능한 스킬 표현을 향한 실용적 단계로 이해하는 것이 최선임을 시사합니다.

English

LLM agents increasingly rely on reusable skills, capability packages that combine instructions, control flow, constraints, and tool calls. In most current agent systems, however, skills are still represented by text-heavy artifacts, including SKILL.md-style documents and structured records whose machine-usable evidence remains embedded largely in natural-language descriptions. This poses a challenge for skill-centered agent systems: managing skill collections and using skills to support agent both require reasoning over invocation interfaces, execution structure, and concrete side effects that are often entangled in a single textual surface. An explicit representation of skill knowledge may therefore help make these artifacts easier for machines to acquire and leverage. Drawing on Memory Organization Packets, Script Theory, and Conceptual Dependency from Schank and Abelson's classical work on linguistic knowledge representation, we introduce what is, to our knowledge, the first structured representation for agent skill artifacts that disentangles skill-level scheduling signals, scene-level execution structure, and logic-level action and resource-use evidence: the Scheduling-Structural-Logical (SSL) representation. We instantiate SSL with an LLM-based normalizer and evaluate it on a corpus of skills in two tasks, Skill Discovery and Risk Assessment, and superiorly outperform the text-only baselines: in Skill Discovery, SSL improves MRR from 0.573 to 0.707; in Risk Assessment, it improves macro F1 from 0.744 to 0.787. These findings reveal that explicit, source-grounded structure makes agent skills easier to search and review. They also suggest that SSL is best understood as a practical step toward more inspectable, reusable, and operationally actionable skill representations for agent systems, rather than as a finished standard or an end-to-end mechanism for managing and using skills.

기술 텍스트에서 기술 구조로: 에이전트 기술의 스케줄링-구조-논리 표현

From Skill Text to Skill Structure: The Scheduling-Structural-Logical Representation for Agent Skills

초록

Support