意味的類似性を超えて：直接コーパス相互作用によるエージェント検索のための検索手法の再考

要旨

現代の検索システムは、語彙的であれ意味的であれ、コーパスを単一のトップk検索ステップに圧縮する固定的な類似性インターフェースを通じて公開しています。この抽象化は効率的ですが、エージェント型検索においてはボトルネックとなります。厳密な語彙的制約、疎な手がかりの結合、局所的な文脈チェック、多段階の仮説精緻化は、従来の既製検索システムを呼び出すだけでは実装が困難であり、早期に除外された証拠は強力な下流の推論によって回復できません。エージェント型タスクはこの制限をさらに悪化させます。なぜなら、エージェントは中間エンティティの発見、弱い手がかりの組み合わせ、部分的な証拠観測後の計画修正など、複数のステップを調整する必要があるためです。この制限に対処するため、私たちは直接コーパスインタラクション（DCI）を研究します。これは、エージェントが埋め込みモデル、ベクトルインデックス、検索APIを一切使用せず、汎用ターミナルツール（grep、ファイル読み込み、シェルコマンド、軽量スクリプトなど）で生のコーパスを直接検索する手法です。このアプローチはオフラインインデックス作成を必要とせず、進化するローカルコーパスに自然に適応します。IRベンチマークとエンドツーエンドのエージェント型検索タスクにおいて、このシンプルな設定は複数のBRIGHTおよびBEIRデータセットで強力なスパース、デンス、リランキングベースラインを大幅に上回り、従来の意味的検索器に依存せずにBrowseComp-PlusおよびマルチホップQAで高い精度を達成します。私たちの結果は、言語エージェントが強力になるにつれて、検索品質が推論能力だけでなく、モデルがコーパスと相互作用するインターフェースの解像度にも依存することを示しています。DCIはこのようなエージェント型検索のための、より広範なインターフェース設計空間を開くものです。

English

Modern retrieval systems, whether lexical or semantic, expose a corpus through a fixed similarity interface that compresses access into a single top-k retrieval step before reasoning. This abstraction is efficient, but for agentic search, it becomes a bottleneck: exact lexical constraints, sparse clue conjunctions, local context checks, and multi-step hypothesis refinement are difficult to implement by calling a conventional off-the-shelf retriever, and evidence filtered out early cannot be recovered by stronger downstream reasoning. Agentic tasks further exacerbate this limitation because they require agents to orchestrate multiple steps, including discovering intermediate entities, combining weak clues, and revising the plan after observing partial evidence. To tackle the limitation, we study direct corpus interaction (DCI), where an agent searches the raw corpus directly with general-purpose terminal tools (e.g., grep, file reads, shell commands, lightweight scripts), without any embedding model, vector index, or retrieval API. This approach requires no offline indexing and adapts naturally to evolving local corpora. Across IR benchmarks and end-to-end agentic search tasks, this simple setup substantially outperforms strong sparse, dense, and reranking baselines on several BRIGHT and BEIR datasets, and attains strong accuracy on BrowseComp-Plus and multi-hop QA without relying on any conventional semantic retriever. Our results indicate that as language agents become stronger, retrieval quality depends not only on reasoning ability but also on the resolution of the interface through which the model interacts with the corpus, with which DCI opens a broader interface-design space for agentic search.

意味的類似性を超えて：直接コーパス相互作用によるエージェント検索のための検索手法の再考

Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction

要旨

Support