Memp: 에이전트 절차적 메모리 탐구

초록

대형 언어 모델(LLMs) 기반 에이전트는 다양한 작업에서 뛰어난 성능을 보이지만, 수동으로 설계되거나 정적 매개변수에 얽힌 취약한 절차적 메모리로 인해 한계를 겪는다. 본 연구에서는 에이전트에게 학습 가능하고 업데이트 가능하며 평생 지속되는 절차적 메모리를 부여하기 위한 전략을 탐구한다. 우리는 과거 에이전트 궤적을 세분화된 단계별 지침과 더 높은 수준의 스크립트 형태의 추상화로 정제하는 Memp를 제안하고, 절차적 메모리의 구축, 검색, 업데이트를 위한 다양한 전략의 영향을 탐구한다. 이 저장소는 새로운 경험과 동시에 진화하며, 내용을 지속적으로 업데이트, 수정, 폐기하는 동적 체제와 결합된다. TravelPlanner와 ALFWorld에서의 실험적 평가는 메모리 저장소가 개선됨에 따라 에이전트가 유사한 작업에서 꾸준히 더 높은 성공률과 더 큰 효율성을 달성함을 보여준다. 또한, 더 강력한 모델로부터 구축된 절차적 메모리는 그 가치를 유지한다: 절차적 메모리를 더 약한 모델로 이전하면 상당한 성능 향상을 얻을 수 있다.

English

Large Language Models (LLMs) based agents excel at diverse tasks, yet they suffer from brittle procedural memory that is manually engineered or entangled in static parameters. In this work, we investigate strategies to endow agents with a learnable, updatable, and lifelong procedural memory. We propose Memp that distills past agent trajectories into both fine-grained, step-by-step instructions and higher-level, script-like abstractions, and explore the impact of different strategies for Build, Retrieval, and Update of procedural memory. Coupled with a dynamic regimen that continuously updates, corrects, and deprecates its contents, this repository evolves in lockstep with new experience. Empirical evaluation on TravelPlanner and ALFWorld shows that as the memory repository is refined, agents achieve steadily higher success rates and greater efficiency on analogous tasks. Moreover, procedural memory built from a stronger model retains its value: migrating the procedural memory to a weaker model yields substantial performance gains.

Memp: 에이전트 절차적 메모리 탐구

Memp: Exploring Agent Procedural Memory

초록

Support