MemEvolve: 에이전트 메모리 시스템의 메타 진화

초록

자기 진화 메모리 시스템은 대규모 언어 모델(LLM) 기반 에이전트의 진화 패러다임을 전례 없이 재편하고 있다. 기존 연구는 주로 수동으로 설계된 메모리 아키텍처에 의존하여 행동 궤적을 저장하고, 경험을 정제하며, 재사용 가능한 도구를 합성함으로써 에이전트가 환경 상호작용 속에서 즉시 진화할 수 있도록 했다. 그러나 이러한 패러다임은 메모리 시스템 자체의 정적 특성에 의해 근본적으로 제한된다. 즉, 메모리가 에이전트 수준의 진화를 용이하게 하는 반면, 기반이 되는 메모리 아키텍처는 다양한 작업 맥락에 맞게 메타 적응될 수 없다. 이러한 격차를 해결하기 위해 우리는 에이전트의 경험적 지식과 메모리 아키텍처를 공동으로 진화시켜, 에이전트 시스템이 경험을 축적할 뿐만 아니라 그로부터 학습하는 방식을 점진적으로 개선할 수 있도록 하는 메타 진화 프레임워크인 MemEvolve를 제안한다. MemEvolve를 선행 연구에 기반을 두고 미래 자기 진화 시스템의 개방성을 촉진하기 위해, 우리는 12개의 대표적인 메모리 시스템을 모듈식 설계 공간(인코드, 저장, 검색, 관리)으로 정제한 통합 자기 진화 메모리 코드베이스인 EvolveLab을 소개한다. 이는 표준화된 구현 기반과 공정한 실험 장을 동시에 제공한다. 4가지 도전적인 에이전트 벤치마크에 대한 포괄적인 평가를 통해 MemEvolve가 (I) SmolAgent 및 Flash-Searcher와 같은 프레임워크를 최대 17.06% 향상시키는 상당한 성능 이득을 달성하고, (II) 다양한 벤치마크와 백본 모델을 효과적으로 전이하는 메모리 아키텍처를 설계하는 강력한 교차 작업 및 교차 LLM 일반화 능력을 보여줌을 입증한다.

English

Self-evolving memory systems are unprecedentedly reshaping the evolutionary paradigm of large language model (LLM)-based agents. Prior work has predominantly relied on manually engineered memory architectures to store trajectories, distill experience, and synthesize reusable tools, enabling agents to evolve on the fly within environment interactions. However, this paradigm is fundamentally constrained by the staticity of the memory system itself: while memory facilitates agent-level evolving, the underlying memory architecture cannot be meta-adapted to diverse task contexts. To address this gap, we propose MemEvolve, a meta-evolutionary framework that jointly evolves agents' experiential knowledge and their memory architecture, allowing agent systems not only to accumulate experience but also to progressively refine how they learn from it. To ground MemEvolve in prior research and foster openness in future self-evolving systems, we introduce EvolveLab, a unified self-evolving memory codebase that distills twelve representative memory systems into a modular design space (encode, store, retrieve, manage), providing both a standardized implementation substrate and a fair experimental arena. Extensive evaluations on four challenging agentic benchmarks demonstrate that MemEvolve achieves (I) substantial performance gains, improving frameworks such as SmolAgent and Flash-Searcher by up to 17.06%; and (II) strong cross-task and cross-LLM generalization, designing memory architectures that transfer effectively across diverse benchmarks and backbone models.

MemEvolve: 에이전트 메모리 시스템의 메타 진화

MemEvolve: Meta-Evolution of Agent Memory Systems

초록

Support