Memento-Skills: エージェントにエージェントを設計させる

要旨

我々はMemento-Skillsを紹介する。これは汎用性が高く継続学習可能なLLMエージェントシステムであり、エージェント設計エージェントとして機能する：経験を通じてタスク特化型エージェントを自律的に構築・適応・改善する。本システムは状態保持プロンプトを用いたメモリベース強化学習フレームワーク上に構築され、再利用可能なスキル（構造化マークダウンファイルとして保存）が永続的かつ進化するメモリとして機能する。これらのスキルは行動とコンテキストの両方を符号化し、エージェントが対話を超えて知識を継承することを可能にする。基本的な初歩的スキル（Web検索やターミナル操作など）から開始し、エージェントはMemento~2~wang2025memento2で導入されたRead-Write反射的学習メカニズムを通じて継続的に改善する。読取フェーズでは、行動学習可能なスキルルーターが現在の状態保持プロンプトに条件付けられた最も関連性の高いスキルを選択し、書込フェーズでは、エージェントは新たな経験に基づいてスキルライブラリを更新・拡張する。この閉ループ設計により、全ての適応が外部化されたスキルとプロンプトの進化を通じて実現されるため、LLMパラメータを更新することなく継続学習が可能となる。人間が設計したエージェントに依存する従来手法とは異なり、Memento-Skillsは汎用エージェントが新規タスク向けにエンドツーエンドでエージェントを設計することを可能にする。反復的なスキル生成と洗練を通じて、システムは自身の能力を段階的に向上させる。General AI AssistantsベンチマークとHumanity's Last Examでの実験では持続的な性能向上を示し、それぞれ総合精度で26.2%、116.2%の相対的改善を達成した。コードはhttps://github.com/Memento-Teams/Memento-Skillsで公開されている。

English

We introduce Memento-Skills, a generalist, continually-learnable LLM agent system that functions as an agent-designing agent: it autonomously constructs, adapts, and improves task-specific agents through experience. The system is built on a memory-based reinforcement learning framework with stateful prompts, where reusable skills (stored as structured markdown files) serve as persistent, evolving memory. These skills encode both behaviour and context, enabling the agent to carry forward knowledge across interactions. Starting from simple elementary skills (like Web search and terminal operations), the agent continually improves via the Read--Write Reflective Learning mechanism introduced in Memento~2~wang2025memento2. In the read phase, a behaviour-trainable skill router selects the most relevant skill conditioned on the current stateful prompt; in the write phase, the agent updates and expands its skill library based on new experience. This closed-loop design enables continual learning without updating LLM parameters, as all adaptation is realised through the evolution of externalised skills and prompts. Unlike prior approaches that rely on human-designed agents, Memento-Skills enables a generalist agent to design agents end-to-end for new tasks. Through iterative skill generation and refinement, the system progressively improves its own capabilities. Experiments on the General AI Assistants benchmark and Humanity's Last Exam demonstrate sustained gains, achieving 26.2\% and 116.2\% relative improvements in overall accuracy, respectively. Code is available at https://github.com/Memento-Teams/Memento-Skills.

Memento-Skills: エージェントにエージェントを設計させる

Memento-Skills: Let Agents Design Agents

要旨

Support