WebResearcher: 長期的な視野を持つエージェントにおける無制限の推論能力の解放

要旨

深層研究システムの最近の進展により、AIエージェントが外部ソースから自律的に知識を発見し統合する可能性が示されています。本論文では、そのようなエージェントを構築するための新しいフレームワークであるWebResearcherを紹介します。このフレームワークは2つの主要なコンポーネントで構成されています：(1) WebResearcherは、深層研究をマルコフ決定過程として再定式化する反復的深層研究パラダイムであり、エージェントが定期的に発見を進化するレポートに統合しながら、焦点を絞ったワークスペースを維持することで、既存の単一コンテキストアプローチを悩ませるコンテキストの窒息やノイズ汚染を克服します。(2) WebFrontierは、ツールを活用した複雑性のエスカレーションを通じて高品質なトレーニングデータを生成するスケーラブルなデータ合成エンジンであり、受動的な知識の想起と能動的な知識の構築の間のギャップを埋める研究タスクの体系的作成を可能にします。特に、我々のパラダイムから得られたトレーニングデータは、従来の単一コンテキスト手法のツール使用能力も大幅に向上させることが分かりました。さらに、我々のパラダイムは並列思考を通じて自然にスケールし、より包括的な結論を得るための並列マルチエージェント探索を可能にします。6つの挑戦的なベンチマークでの広範な実験により、WebResearcherが最先端の性能を達成し、フロンティアのプロプライエタリシステムさえも凌駕することが実証されました。

English

Recent advances in deep-research systems have demonstrated the potential for AI agents to autonomously discover and synthesize knowledge from external sources. In this paper, we introduce WebResearcher, a novel framework for building such agents through two key components: (1) WebResearcher, an iterative deep-research paradigm that reformulates deep research as a Markov Decision Process, where agents periodically consolidate findings into evolving reports while maintaining focused workspaces, overcoming the context suffocation and noise contamination that plague existing mono-contextual approaches; and (2) WebFrontier, a scalable data synthesis engine that generates high-quality training data through tool-augmented complexity escalation, enabling systematic creation of research tasks that bridge the gap between passive knowledge recall and active knowledge construction. Notably, we find that the training data from our paradigm significantly enhances tool-use capabilities even for traditional mono-contextual methods. Furthermore, our paradigm naturally scales through parallel thinking, enabling concurrent multi-agent exploration for more comprehensive conclusions. Extensive experiments across 6 challenging benchmarks demonstrate that WebResearcher achieves state-of-the-art performance, even surpassing frontier proprietary systems.

WebResearcher: 長期的な視野を持つエージェントにおける無制限の推論能力の解放

WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents

要旨

Support