EvolveMem: Zelf-evoluerende Geheugenarchitectuur via AutoResearch voor LLM-agenten

Samenvatting

Langetermijngeheugen is essentieel voor LLM-agenten die over meerdere sessies opereren, maar bestaande geheugensystemen behandelen de retrievalinfrastructuur als vast: opgeslagen inhoud evolueert, terwijl scoreringsfuncties, fusiestrategieën en antwoordgeneratiebeleid bevroren blijven bij implementatie. Wij stellen dat een werkelijk adaptief geheugen co-evolutie op twee niveaus vereist: de opgeslagen kennis en het retrievalmechanisme dat deze bevraagt. We presenteren EvolveMem, een zelf-evoluerende geheugenarchitectuur die de volledige retrievalconfiguratie blootlegt als een gestructureerde actieruimte, geoptimaliseerd door een door een LLM aangedreven diagnosemodule. In elke evolutieronde leest de module per-vraag faallogboeken, identificeert grondoorzaken en stelt gerichte configuratieaanpassingen voor; een bewaakte meta-analyzer past deze toe met automatische terugdraaiing bij regressie en exploratie bij stagnatie als waarborgen. Deze gesloten-lus zelf-evolutie realiseert een AutoResearch-proces: het systeem voert autonoom iteratieve onderzoekscycli uit op zijn eigen architectuur, ter vervanging van handmatige configuratietuning. Vanaf een minimale baseline convergeert het proces autonoom en ontdekt effectieve retrievalstrategieën, inclusief volledig nieuwe configuratiedimensies die niet aanwezig waren in de oorspronkelijke actieruimte. Op LoCoMo overtreft EvolveMem de sterkste baseline met 25,7% relatief en behaalt een relatieve verbetering van 78,0% ten opzichte van de minimale baseline. Op MemBench overtreft EvolveMem de sterkste baseline met 18,9% relatief. Geëvolueerde configuraties worden positief overgedragen tussen benchmarks, zonder catastrofale overdracht, wat aangeeft dat het zelf-evolutieproces universele retrievalprincipes vastlegt in plaats van benchmarkspecifieke heuristieken. Code is beschikbaar op https://github.com/aiming-lab/SimpleMem.

English

Long-term memory is essential for LLM agents that operate across multiple sessions, yet existing memory systems treat retrieval infrastructure as fixed: stored content evolves while scoring functions, fusion strategies, and answer-generation policies remain frozen at deployment. We argue that truly adaptive memory requires co-evolution at two levels: the stored knowledge and the retrieval mechanism that queries it. We present EvolveMem, a self-evolving memory architecture that exposes its full retrieval configuration as a structured action space optimized by an LLM-powered diagnosis module. In each evolution round, the module reads per-question failure logs, identifies root causes, and proposes targeted configuration adjustments; a guarded meta-analyzer applies them with automatic revert-on-regression and explore-on-stagnation safeguards. This closed-loop self-evolution realizes an AutoResearch process: the system autonomously conducts iterative research cycles on its own architecture, replacing manual configuration tuning. Starting from a minimal baseline, the process converges autonomously, discovering effective retrieval strategies including entirely new configuration dimensions not present in the original action space. On LoCoMo, EvolveMem outperforms the strongest baseline by 25.7% relative and achieves a 78.0% relative improvement over the minimal baseline. On MemBench, EvolveMem exceeds the strongest baseline by 18.9% relative. Evolved configurations transfer across benchmarks with positive rather than catastrophic transfer, indicating that the self-evolution process captures universal retrieval principles rather than benchmark-specific heuristics. Code is available at https://github.com/aiming-lab/SimpleMem.