MemEvolve: エージェントメモリシステムのメタ進化

要旨

自己進化型メモリシステムは、大規模言語モデル（LLM）ベースのエージェントの進化的パラダイムを前例なく再構築しつつある。従来の研究では、軌跡の保存、経験の蒸留、再利用可能なツールの合成を目的として手動設計されたメモリ構造に依存することが主流であり、これによりエージェントは環境相互作用内でオンザフライに進化することが可能であった。しかし、このパラダイムはメモリシステム自体の静的な性質によって根本的に制約されている。すなわち、メモリがエージェントレベルの進化を促進する一方で、基盤となるメモリ構造は多様なタスクコンテキストにメタ適応できないのである。この課題を解決するため、我々はMemEvolveを提案する。これはエージェントの経験的知識とメモリ構造を共進化させるメタ進化フレームワークであり、エージェントシステムが経験を蓄積するだけでなく、その学習方法を漸進的に洗練させることを可能にする。MemEvolveを先行研究に位置づけ、将来の自己進化システムの開放性を促進するため、EvolveLabを導入する。これは12の代表的なメモリシステムをモジュラー設計空間（符号化、保存、検索、管理）に蒸留した統一自己進化メモリコードベースであり、標準化された実装基盤と公平な実験環境を提供する。4つの挑戦的なエージェントベンチマークにおける大規模評価により、MemEvolveが（I）SmolAgentやFlash-Searcherなどのフレームワークを最大17.06%改善する大幅な性能向上を達成し、（II）強いクロスタスク・クロスLLM一般化能力を示し、多様なベンチマークと基盤モデルに効果的に転移可能なメモリ構造を設計することを実証した。

English

Self-evolving memory systems are unprecedentedly reshaping the evolutionary paradigm of large language model (LLM)-based agents. Prior work has predominantly relied on manually engineered memory architectures to store trajectories, distill experience, and synthesize reusable tools, enabling agents to evolve on the fly within environment interactions. However, this paradigm is fundamentally constrained by the staticity of the memory system itself: while memory facilitates agent-level evolving, the underlying memory architecture cannot be meta-adapted to diverse task contexts. To address this gap, we propose MemEvolve, a meta-evolutionary framework that jointly evolves agents' experiential knowledge and their memory architecture, allowing agent systems not only to accumulate experience but also to progressively refine how they learn from it. To ground MemEvolve in prior research and foster openness in future self-evolving systems, we introduce EvolveLab, a unified self-evolving memory codebase that distills twelve representative memory systems into a modular design space (encode, store, retrieve, manage), providing both a standardized implementation substrate and a fair experimental arena. Extensive evaluations on four challenging agentic benchmarks demonstrate that MemEvolve achieves (I) substantial performance gains, improving frameworks such as SmolAgent and Flash-Searcher by up to 17.06%; and (II) strong cross-task and cross-LLM generalization, designing memory architectures that transfer effectively across diverse benchmarks and backbone models.

MemEvolve: エージェントメモリシステムのメタ進化

MemEvolve: Meta-Evolution of Agent Memory Systems

要旨

Support