Ontkoppelde planning en uitvoering: Een hiërarchisch redeneerkader voor diepe zoekopdrachten

Samenvatting

Complexe informatiebehoeften in real-world zoekscenario's vereisen diepgaand redeneren en kennissynthese over diverse bronnen, waar traditionele retrieval-augmented generation (RAG) pijplijnen moeite mee hebben om effectief aan te pakken. Huidige op redenering gebaseerde benaderingen kampen met een fundamentele beperking: ze gebruiken één model om zowel hoogwaardige planning als gedetailleerde uitvoering te hanteren, wat leidt tot inefficiënt redeneren en beperkte schaalbaarheid. In dit artikel introduceren we HiRA, een hiërarchisch framework dat strategische planning scheidt van gespecialiseerde uitvoering. Onze aanpak deconstrueert complexe zoektaken in gerichte subtaken, wijst elke subtask toe aan domeinspecifieke agents die zijn uitgerust met externe tools en redeneervermogen, en coördineert de resultaten via een gestructureerd integratiemechanisme. Deze scheiding voorkomt dat uitvoeringsdetails het hoogwaardige redeneren verstoren, terwijl het systeem gespecialiseerde expertise kan benutten voor verschillende soorten informatieverwerking. Experimenten op vier complexe, cross-modale diepe zoekbenchmarks tonen aan dat HiRA aanzienlijk beter presteert dan state-of-the-art RAG en agent-gebaseerde systemen. Onze resultaten laten verbeteringen zien in zowel antwoordkwaliteit als systeemefficiëntie, wat de effectiviteit van ontkoppelde planning en uitvoering voor meerstaps informatiezoektaken benadrukt. Onze code is beschikbaar op https://github.com/ignorejjj/HiRA.

English

Complex information needs in real-world search scenarios demand deep reasoning and knowledge synthesis across diverse sources, which traditional retrieval-augmented generation (RAG) pipelines struggle to address effectively. Current reasoning-based approaches suffer from a fundamental limitation: they use a single model to handle both high-level planning and detailed execution, leading to inefficient reasoning and limited scalability. In this paper, we introduce HiRA, a hierarchical framework that separates strategic planning from specialized execution. Our approach decomposes complex search tasks into focused subtasks, assigns each subtask to domain-specific agents equipped with external tools and reasoning capabilities, and coordinates the results through a structured integration mechanism. This separation prevents execution details from disrupting high-level reasoning while enabling the system to leverage specialized expertise for different types of information processing. Experiments on four complex, cross-modal deep search benchmarks demonstrate that HiRA significantly outperforms state-of-the-art RAG and agent-based systems. Our results show improvements in both answer quality and system efficiency, highlighting the effectiveness of decoupled planning and execution for multi-step information seeking tasks. Our code is available at https://github.com/ignorejjj/HiRA.

Ontkoppelde planning en uitvoering: Een hiërarchisch redeneerkader voor diepe zoekopdrachten

Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search

Samenvatting

Support