Web2BigTable: インターネット規模の情報検索・抽出のための二段階マルチエージェントLLMシステム

要旨

エージェント型Web検索は現在、単一ターゲットに対する深い推論と、多数のエンティティや多様な情報源にわたる構造化された集約という、2つの異なる要求に直面しています。現行システムは両面で課題を抱えています。広さ志向のタスクは、広範なカバレッジとエンティティ間の一貫性を持つスキーマ整合型の出力を要求し、深さ志向のタスクは、長く分岐する検索軌跡にわたる一貫した推論を必要とします。本論文では、両方の体制をサポートするWeb-to-table検索のためのマルチエージェントフレームワーク「Web2BigTable」を提案します。Web2BigTableは、上位レベルのオーケストレーターがタスクを副問題に分解し、下位レベルのワーカーエージェントがそれらを並列解決する二層アーキテクチャを採用しています。閉ループの「実行―検証―反映」プロセスを通じて、本フレームワークは、永続的で人間可読な外部メモリを介し、各単体エージェントへの自己進化的な更新とともに、分解と実行を時間とともに共同で改善します。実行中、ワーカーは部分的な発見を可視化する共有ワークスペースを通じて調整し、冗長な探索の削減、矛盾する証拠の調整、新たに生じるカバレッジのギャップへの適応を可能にします。Web2BigTableはWideSearchにおいて新たなstate-of-the-artを達成し、Avg@4成功率38.50（2位の5.10の7.5倍）、Row F1 63.53（2位比+25.03）、Item F1 80.12（2位比+14.42）を記録しました。また、XBench-DeepSearchにおける深さ志向検索にも汎化し、73.0%の精度を達成しました。コードはhttps://github.com/web2bigtable/web2bigtable で公開されています。

English

Agentic web search increasingly faces two distinct demands: deep reasoning over a single target, and structured aggregation across many entities and heterogeneous sources. Current systems struggle on both fronts. Breadth-oriented tasks demand schema-aligned outputs with wide coverage and cross-entity consistency, while depth-oriented tasks require coherent reasoning over long, branching search trajectories. We introduce Web2BigTable, a multi-agent framework for web-to-table search that supports both regimes. Web2BigTable adopts a bi-level architecture in which an upper-level orchestrator decomposes the task into sub-problems and lower-level worker agents solve them in parallel. Through a closed-loop run--verify--reflect process, the framework jointly improves decomposition and execution over time via persistent, human-readable external memory, with self-evolving updates to each single-agent. During execution, workers coordinate through a shared workspace that makes partial findings visible, allowing them to reduce redundant exploration, reconcile conflicting evidence, and adapt to emerging coverage gaps. Web2BigTable sets a new state of the art on WideSearch, reaching an Avg@4 Success Rate of 38.50 (7.5times the second best at 5.10), Row F1 of 63.53 (+25.03 over the second best), and Item F1 of 80.12 (+14.42 over the second best). It also generalises to depth-oriented search on XBench-DeepSearch, achieving 73.0 accuracy. Code is available at https://github.com/web2bigtable/web2bigtable.

Web2BigTable: インターネット規模の情報検索・抽出のための二段階マルチエージェントLLMシステム

Web2BigTable: A Bi-Level Multi-Agent LLM System for Internet-Scale Information Search and Extraction

要旨

Support