Op weg naar schaalbare synthese van terminale taken via vaardigheidsgrafen

Samenvatting

Terminale agents hebben een sterk potentieel getoond voor autonome commandoregeluitvoering, maar hun training blijft beperkt door de schaarste aan hoogwaardige en diverse uitvoeringstrajecten. Bestaande benaderingen verlichten dit knelpunt door grootschalige terminaltaakinstanties te synthetiseren voor trajectmonsters. Zij richten zich echter voornamelijk op het opschalen van het aantal taken, terwijl ze slechts beperkte controle bieden over de diversiteit van uitvoeringstrajecten die agents daadwerkelijk ervaren tijdens de training. In dit artikel presenteren we SkillSynth, een geautomatiseerd raamwerk voor terminaltaaksynthese, gebaseerd op een scenario-gemedieerde vaardigheidsgraaf. SkillSynth construeert eerst een grootschalige vaardigheidsgraaf, waarbij scenario's fungeren als intermediare transitieknopen die diverse commandoregelvaardigheden verbinden. Vervolgens bemonstert het paden vanuit deze graaf als abstracties van real-world werkstromen, en gebruikt het een multi-agent systeem om deze te instantiëren naar uitvoerbare taakinstanties. Door taaksynthese te grondvesten in graf-bemonsterde werkstroompaden, beheerst SkillSynth expliciet de diversiteit van minimale uitvoeringstrajecten die nodig zijn om de gesynthetiseerde taken op te lossen. Experimenten op Terminal-Bench demonstreren de effectiviteit van SkillSynth. Bovendien zijn taakinstanties gesynthetiseerd door SkillSynth geadopteerd om Hy3 Preview te trainen, wat bijdraagt aan diens verbeterde agent-mogelijkheden in terminal-gebaseerde omgevingen.

English

Terminal agents have demonstrated strong potential for autonomous command-line execution, yet their training remains constrained by the scarcity of high-quality and diverse execution trajectories. Existing approaches mitigate this bottleneck by synthesizing large-scale terminal task instances for trajectory sampling. However, they primarily focus on scaling the number of tasks while providing limited control over the diversity of execution trajectories that agents actually experience during training. In this paper, we present SkillSynth, an automated framework for terminal task synthesis built on a scenario-mediated skill graph. SkillSynth first constructs a large-scale skill graph, where scenarios serve as intermediate transition nodes that connect diverse command-line skills. It then samples paths from this graph as abstractions of real-world workflows, and uses a multi-agent harness to instantiate them into executable task instances. By grounding task synthesis in graph-sampled workflow paths, SkillSynth explicitly controls the diversity of minimal execution trajectories required to solve the synthesized tasks. Experiments on Terminal-Bench demonstrate the effectiveness of SkillSynth. Moreover, task instances synthesized by SkillSynth have been adopted to train Hy3 Preview, contributing to its enhanced agentic capabilities in terminal-based settings.

Op weg naar schaalbare synthese van terminale taken via vaardigheidsgrafen

Toward Scalable Terminal Task Synthesis via Skill Graphs

Samenvatting

Support