Memp: エージェントの手続き的メモリの探求

要旨

大規模言語モデル（LLM）ベースのエージェントは多様なタスクに優れているが、手動で設計されたり静的なパラメータに埋め込まれたりする脆弱な手続き記憶に悩まされている。本研究では、エージェントに学習可能で更新可能かつ生涯にわたる手続き記憶を付与する戦略を探る。我々は、過去のエージェント軌跡を細粒度のステップバイステップの指示と高レベルのスクリプトのような抽象化の両方に蒸留するMempを提案し、手続き記憶の構築、検索、更新のための異なる戦略の影響を探る。その内容を継続的に更新、修正、廃棄する動的なレジームと組み合わせることで、このリポジトリは新しい経験と共に進化する。TravelPlannerとALFWorldでの実証評価では、記憶リポジトリが洗練されるにつれて、エージェントは類似タスクにおいて着実に高い成功率と効率性を達成することが示された。さらに、より強力なモデルから構築された手続き記憶はその価値を保持し、それをより弱いモデルに移行しても大幅な性能向上が得られる。

English

Large Language Models (LLMs) based agents excel at diverse tasks, yet they suffer from brittle procedural memory that is manually engineered or entangled in static parameters. In this work, we investigate strategies to endow agents with a learnable, updatable, and lifelong procedural memory. We propose Memp that distills past agent trajectories into both fine-grained, step-by-step instructions and higher-level, script-like abstractions, and explore the impact of different strategies for Build, Retrieval, and Update of procedural memory. Coupled with a dynamic regimen that continuously updates, corrects, and deprecates its contents, this repository evolves in lockstep with new experience. Empirical evaluation on TravelPlanner and ALFWorld shows that as the memory repository is refined, agents achieve steadily higher success rates and greater efficiency on analogous tasks. Moreover, procedural memory built from a stronger model retains its value: migrating the procedural memory to a weaker model yields substantial performance gains.

Memp: エージェントの手続き的メモリの探求

Memp: Exploring Agent Procedural Memory

要旨

Support