分離された計画と実行：深層探索のための階層的推論フレームワーク

要旨

現実世界の検索シナリオにおける複雑な情報ニーズは、多様な情報源にわたる深い推論と知識の統合を要求しますが、従来の検索拡張生成（RAG）パイプラインではこれを効果的に解決することが困難です。現在の推論ベースのアプローチは、根本的な制約を抱えています。それは、高レベルの計画立案と詳細な実行の両方を単一のモデルで処理するため、非効率な推論とスケーラビリティの限界を引き起こす点です。本論文では、戦略的計画立案と専門的な実行を分離する階層型フレームワークであるHiRAを紹介します。我々のアプローチは、複雑な検索タスクを焦点を絞ったサブタスクに分解し、外部ツールと推論能力を備えたドメイン固有のエージェントに各サブタスクを割り当て、構造化された統合メカニズムを通じて結果を調整します。この分離により、実行の詳細が高レベルの推論を妨げることを防ぎながら、異なるタイプの情報処理に対して専門的な知見を活用することが可能になります。4つの複雑なクロスモーダル深層検索ベンチマークでの実験により、HiRAが最先端のRAGおよびエージェントベースのシステムを大幅に上回ることを実証しました。我々の結果は、回答品質とシステム効率の両方において改善を示し、多段階の情報探索タスクにおける計画立案と実行の分離の有効性を強調しています。コードはhttps://github.com/ignorejjj/HiRAで公開されています。

English

Complex information needs in real-world search scenarios demand deep reasoning and knowledge synthesis across diverse sources, which traditional retrieval-augmented generation (RAG) pipelines struggle to address effectively. Current reasoning-based approaches suffer from a fundamental limitation: they use a single model to handle both high-level planning and detailed execution, leading to inefficient reasoning and limited scalability. In this paper, we introduce HiRA, a hierarchical framework that separates strategic planning from specialized execution. Our approach decomposes complex search tasks into focused subtasks, assigns each subtask to domain-specific agents equipped with external tools and reasoning capabilities, and coordinates the results through a structured integration mechanism. This separation prevents execution details from disrupting high-level reasoning while enabling the system to leverage specialized expertise for different types of information processing. Experiments on four complex, cross-modal deep search benchmarks demonstrate that HiRA significantly outperforms state-of-the-art RAG and agent-based systems. Our results show improvements in both answer quality and system efficiency, highlighting the effectiveness of decoupled planning and execution for multi-step information seeking tasks. Our code is available at https://github.com/ignorejjj/HiRA.

分離された計画と実行：深層探索のための階層的推論フレームワーク

Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search

要旨

Support