AgentFugue: 集団的推論による長期タスクのエージェントスケーリング

要旨

近年、長期的なエージェントタスクの進展は主に、より強力なモデル、優れたツール、効果的な足場掛けを通じて個々のエージェントをスケールアップすることによって推進されてきた。対照的に、スケールアウト（同じタスクを対象とする複数のピアエージェントが、明示的な役割の専門化やワークフローのオーケストレーションに依存せずに、追加の能力源となり得るか）については、はるかに理解が進んでいない。我々はこの問題を研究し、共有推論ハブを中核とする集団推論フレームワーク「AgentFugue」を提案する。ピアエージェントが同じタスクを並行して探索する際、ハブは各エージェントが確立、試行、または排除した内容について簡潔なメモを記録し、各エージェントが現在の探索に有用な形式で他のエージェントの発見に選択的にアクセスできるようにする。この設計により、従来は孤立していた軌跡が、中央集権的な計画を必要とせずに、再利用可能な中間推論の相互接続された生態系へと変貌する。我々はハブをプラグイン型の通信層として具体化し、教師ありファインチューニングとエンドツーエンドの強化学習で訓練する。我々が研究する困難な長期的設定において、AgentFugueは強力なベースラインを上回る改善を示す。この結果は、集団推論がピアエージェントシステムのスケールアウトを、単なる計算資源の追加消費ではなく、明確な能力向上の源泉に変え得ることを示唆している。

English

Recent progress on long-horizon agentic tasks has been driven largely by scaling up individual agents through stronger models, better tools, and more effective scaffolding. In contrast, much less is understood about scaling out: whether multiple peer agents, all targeting the same task, can become an additional source of capability without relying on explicit role specialization or workflow orchestration. We study this question and propose AgentFugue, a collective reasoning framework built around a shared reasoning hub. As peer agents explore the same task in parallel, the hub records concise notes on what each agent has established, attempted, or ruled out, and enables each agent to selectively access what other agents have discovered in a form useful for its current search. This design turns otherwise isolated trajectories into a connected ecology of reusable intermediate reasoning without requiring centralized planning. We instantiate the hub as a plug-in communication layer, trained with supervised fine-tuning and end-to-end reinforcement learning. Across the challenging long-horizon settings we study, AgentFugue improves over strong baselines. Our results suggest that collective reasoning can turn scaling out peer agent systems into a distinct source of capability gains, rather than merely a way of spending more compute.