RE-TRAC: 深層探索エージェントのための再帰的軌跡圧縮

要旨

LLMベースの深層研究エージェントは、主にReActフレームワーク上に構築されている。この線形設計では、以前の状態を再訪したり、代替の探索方向に分岐したり、長い文脈下でグローバルな認識を維持することが困難であり、局所最適解、冗長な探索、非効率な検索を引き起こしやすい。我々はRe-TRACを提案する。これは、各軌道後に構造化された状態表現を生成して証拠、不確実性、失敗、将来の計画を要約し、後続の軌道をこの状態表現に条件付けることで、軌道横断的な探索を行うエージェントフレームワークである。これにより、反復的な反省とグローバルに情報化された計画が可能となり、研究を漸進的プロセスとして再構築する。実験結果では、Re-TRACが最先端LLMを用いたBrowseCompにおいて、一貫してReActを15-20%上回る性能を示した。小型モデルについては、Re-TRACを意識した教師ありファインチューニングを導入し、同等規模で最高水準の性能を達成した。特筆すべきは、Re-TRACがラウンドを重ねるごとにツール呼び出しとトークン使用量を単調減少させており、冗長な検索ではなく軌道横断的な反省に駆動された、次第に焦点化される探索を示唆している点である。

English

LLM-based deep research agents are largely built on the ReAct framework. This linear design makes it difficult to revisit earlier states, branch into alternative search directions, or maintain global awareness under long contexts, often leading to local optima, redundant exploration, and inefficient search. We propose Re-TRAC, an agentic framework that performs cross-trajectory exploration by generating a structured state representation after each trajectory to summarize evidence, uncertainties, failures, and future plans, and conditioning subsequent trajectories on this state representation. This enables iterative reflection and globally informed planning, reframing research as a progressive process. Empirical results show that Re-TRAC consistently outperforms ReAct by 15-20% on BrowseComp with frontier LLMs. For smaller models, we introduce Re-TRAC-aware supervised fine-tuning, achieving state-of-the-art performance at comparable scales. Notably, Re-TRAC shows a monotonic reduction in tool calls and token usage across rounds, indicating progressively targeted exploration driven by cross-trajectory reflection rather than redundant search.

RE-TRAC: 深層探索エージェントのための再帰的軌跡圧縮

RE-TRAC: REcursive TRAjectory Compression for Deep Search Agents

要旨

Support