RE-TRAC: Recursieve TRAjectoriecompressie voor Diepgaande Zoekagenten

Samenvatting

LLM-gebaseerde diepgaande onderzoeksagenten zijn grotendeels gebouwd op het ReAct-framework. Deze lineaire ontwerpaanpak maakt het moeilijk om eerdere staten opnieuw te bezoeken, af te wijken naar alternatieve zoekrichtingen of een globaal overzicht te behouden bij lange contexten, wat vaak leidt tot lokale optima, redundante verkenning en inefficiënt zoekgedrag. Wij stellen Re-TRAC voor, een agentframework dat kruis-trajectverkenning uitvoert door na elk traject een gestructureerde staatrepresentatie te genereren om evidence, onzekerheden, fouten en toekomstplannen samen te vatten, en vervolgtrajecten te conditioneren op deze staatrepresentatie. Hierdoor worden iteratieve reflectie en globaal geïnformeerde planning mogelijk, waarbij onderzoek wordt herkaderd als een progressief proces. Empirische resultaten tonen aan dat Re-TRAC op BrowseConsistent 15-20% beter presteert dan ReAct bij gebruik van geavanceerde LLM's. Voor kleinere modellen introduceren we Re-TRAC-bewuste supervised fine-tuning, wat state-of-the-art prestaties oplevert op vergelijkbare schalen. Opmerkelijk is dat Re-TRAC een monotone afname vertoont in toolaanroepen en tokenverbruik over meerdere rondes, wat wijst op progressief gerichte verkenning gedreven door kruis-trajectreflectie in plaats van redundante zoekacties.

English

LLM-based deep research agents are largely built on the ReAct framework. This linear design makes it difficult to revisit earlier states, branch into alternative search directions, or maintain global awareness under long contexts, often leading to local optima, redundant exploration, and inefficient search. We propose Re-TRAC, an agentic framework that performs cross-trajectory exploration by generating a structured state representation after each trajectory to summarize evidence, uncertainties, failures, and future plans, and conditioning subsequent trajectories on this state representation. This enables iterative reflection and globally informed planning, reframing research as a progressive process. Empirical results show that Re-TRAC consistently outperforms ReAct by 15-20% on BrowseComp with frontier LLMs. For smaller models, we introduce Re-TRAC-aware supervised fine-tuning, achieving state-of-the-art performance at comparable scales. Notably, Re-TRAC shows a monotonic reduction in tool calls and token usage across rounds, indicating progressively targeted exploration driven by cross-trajectory reflection rather than redundant search.

RE-TRAC: Recursieve TRAjectoriecompressie voor Diepgaande Zoekagenten

RE-TRAC: REcursive TRAjectory Compression for Deep Search Agents

Samenvatting

Support