PathFinder: 多段階推論パスをガイドする探索

要旨

大規模言語モデルの最近の進展に伴い、推論チェーンを引き出すための連鎖的思考プロンプティングなどの手法が、推論タスクにおける結果の改善に有効であることが示されています。しかし、複数の推論ステップを必要とするタスクは、最先端のモデルにとっても依然として大きな課題となっています。ビームサーチアルゴリズムに着想を得て、我々はPathFinderを提案します。これは、木探索ベースの推論パス生成アプローチであり、動的デコーディングを統合することで多様な分岐とマルチホップ推論を強化します。これは、様々なサンプリング方法とパラメータによって実現されます。制約付き推論を用いることで、PathFinderは新たな品質制約、枝刈り、探索方法を統合し、生成の効率と品質を向上させます。さらに、候補選択を改善するためのスコアリングとランキング機能を備えています。我々のアプローチは、3つの複雑な算術および常識推論タスクにおいて、競合するベースラインを平均6%上回りました。また、我々のモデルは、長く未見の推論チェーンにもうまく一般化し、大きな分岐係数を伴うビームサーチと同様の複雑さを反映しています。

English

With recent advancements in large language models, methods like chain-of-thought prompting to elicit reasoning chains have been shown to improve results on reasoning tasks. However, tasks that require multiple steps of reasoning still pose significant challenges to state-of-the-art models. Drawing inspiration from the beam search algorithm, we propose PathFinder, a tree-search-based reasoning path generation approach. It enhances diverse branching and multi-hop reasoning through the integration of dynamic decoding, enabled by varying sampling methods and parameters. Using constrained reasoning, PathFinder integrates novel quality constraints, pruning, and exploration methods to enhance the efficiency and the quality of generation. Moreover, it includes scoring and ranking features to improve candidate selection. Our approach outperforms competitive baselines on three complex arithmetic and commonsense reasoning tasks by 6% on average. Our model generalizes well to longer, unseen reasoning chains, reflecting similar complexities to beam search with large branching factors.

PathFinder: 多段階推論パスをガイドする探索

PathFinder: Guided Search over Multi-Step Reasoning Paths

要旨

Support