OpenResearcher: 長期的な深層研究軌跡合成のための完全オープンパイプライン

要旨

深層研究エージェントの訓練には、検索、証拠の集約、多段階推論を組み合わせた長期的な軌跡が必要です。しかし、既存のデータ収集パイプラインは一般的に独自のWeb APIに依存しており、大規模な軌跡合成はコストが高く、不安定で、再現性に欠けるという課題があります。本研究ではOpenResearcherを提案します。これは再現性のあるパイプラインであり、一度限りのコーパス構築と多対話軌跡合成を分離し、検索・閲覧ループを1500万ドキュメントのコーパス上で3つの明示的なブラウザ基本操作（検索、開く、検索）を用いて完全オフラインで実行します。GPT-OSS-120Bを教師モデルとして使用し、10万回以上のツール呼び出しを含む長期的な軌跡を多数含む9万7千以上の軌跡を合成しました。これらの軌跡で30B-A3Bバックボーンを教師ありファインチューニングした結果、BrowseComp-Plusで54.8%の精度を達成し、ベースモデル比+34.0ポイントの改善を示しました。同時にBrowseComp、GAIA、xbench-DeepSearchでも競争力のある性能を維持しています。環境がオフラインかつ完全に計装されているため、制御された分析も可能であり、データフィルタリング戦略、エージェント設定の選択肢、検索成功率と最終回答精度の関係など、深層研究パイプライン設計に関する実用的な知見が得られました。パイプライン、合成軌跡、モデルチェックポイント、オフライン検索環境はhttps://github.com/TIGER-AI-Lab/OpenResearcherで公開しています。

English

Training deep research agents requires long-horizon trajectories that interleave search, evidence aggregation, and multi-step reasoning. However, existing data collection pipelines typically rely on proprietary web APIs, making large-scale trajectory synthesis costly, unstable, and difficult to reproduce. We present OpenResearcher, a reproducible pipeline that decouples one-time corpus bootstrapping from multi-turn trajectory synthesis and executes the search-and-browse loop entirely offline using three explicit browser primitives: search, open, and find, over a 15M-document corpus. Using GPT-OSS-120B as the teacher model, we synthesize over 97K trajectories, including a substantial long-horizon tail with 100+ tool calls. Supervised fine-tuning a 30B-A3B backbone on these trajectories achieves 54.8\% accuracy on BrowseComp-Plus, a +34.0 point improvement over the base model, while remaining competitive on BrowseComp, GAIA, and xbench-DeepSearch. Because the environment is offline and fully instrumented, it also enables controlled analysis, where our study reveals practical insights into deep research pipeline design, including data filtering strategies, agent configuration choices, and how retrieval success relates to final answer accuracy. We release the pipeline, synthesized trajectories, model checkpoints, and the offline search environment at https://github.com/TIGER-AI-Lab/OpenResearcher.

OpenResearcher: 長期的な深層研究軌跡合成のための完全オープンパイプライン

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

要旨

Support