PlanGEN: 複雑な問題解決のための計画と推論の軌跡を生成するためのマルチエージェントフレームワーク

要旨

最近のエージェントフレームワークと推論時のアルゴリズムは、生成された計画の検証や単一タスク内のインスタンスの複雑さの変動に対する制約の検証の限界により、複雑な計画問題に苦戦しています。これらのタスクに対する既存の多くの手法は、制約を考慮せずにタスクレベルの検証を行うか、インスタンスレベルの複雑さに適応しない推論時のアルゴリズムを適用します。これらの制限に対処するために、私たちはPlanGENを提案します。これはモデルに依存しない容易にスケーラブルなエージェントフレームワークであり、制約、検証、選択エージェントの3つの主要コンポーネントを備えています。具体的には、我々のアプローチは、制約に誘導された反復的検証を提案し、推論時のアルゴリズム（Best of N、Tree-of-Thought、REBASE）の性能を向上させます。PlanGENフレームワークでは、選択エージェントがインスタンスの複雑さに基づいてアルゴリズムの選択を最適化し、複雑な計画問題に対する適応性を向上させます。実験結果は、最も強力なベースラインに対して著しい改善を示し、NATURAL PLAN（約8%向上）、OlympiadBench（約4%向上）、DocFinQA（約7%向上）、GPQA（約1%向上）で最先端の結果を達成しています。私たちの主要な発見は、制約に誘導された反復的検証が推論時のアルゴリズムを改善し、適応的選択が複雑な計画と推論問題の性能をさらに向上させることを強調しています。

English

Recent agent frameworks and inference-time algorithms often struggle with complex planning problems due to limitations in verifying generated plans or reasoning and varying complexity of instances within a single task. Many existing methods for these tasks either perform task-level verification without considering constraints or apply inference-time algorithms without adapting to instance-level complexity. To address these limitations, we propose PlanGEN, a model-agnostic and easily scalable agent framework with three key components: constraint, verification, and selection agents. Specifically, our approach proposes constraint-guided iterative verification to enhance performance of inference-time algorithms--Best of N, Tree-of-Thought, and REBASE. In PlanGEN framework, the selection agent optimizes algorithm choice based on instance complexity, ensuring better adaptability to complex planning problems. Experimental results demonstrate significant improvements over the strongest baseline across multiple benchmarks, achieving state-of-the-art results on NATURAL PLAN (sim8%uparrow), OlympiadBench (sim4%uparrow), DocFinQA (sim7%uparrow), and GPQA (sim1%uparrow). Our key finding highlights that constraint-guided iterative verification improves inference-time algorithms, and adaptive selection further boosts performance on complex planning and reasoning problems.

PlanGEN: 複雑な問題解決のための計画と推論の軌跡を生成するためのマルチエージェントフレームワーク

PlanGEN: A Multi-Agent Framework for Generating Planning and Reasoning Trajectories for Complex Problem Solving

要旨

Support