AgentDropoutV2：テスト時における修正または拒否による枝刈りを介したマルチエージェントシステムの情報フロー最適化

要旨

マルチエージェントシステム（MAS）は複雑な推論において優れた性能を発揮する一方で、個々の参加者が生成する誤った情報の連鎖的影響に悩まされている。現在の解決策は、厳格な構造設計や高コストなファインチューニングに依存することが多く、展開性と適応性が制限されている。本論文では、再学習を必要とせずにMASの情報フローを動的に最適化するテストタイム修正・棄却プルーニングフレームワーク「AgentDropoutV2」を提案する。本手法は能動的なファイアウォールとして機能し、エージェントの出力を傍受した上で、検索拡張型修正器を用いて失敗駆動型指標プールに基づき誤りを反復的に修正する。このメカニズムにより、蒸留された失敗パターンを事前知識として活用し、潜在的な誤りを精密に特定することが可能となる。修復不能な出力は直ちに刈り込まれ誤りの伝播を防止するとともに、フォールバック戦略によりシステムの完全性が維持される。大規模な数学ベンチマークにおける実験結果から、AgentDropoutV2がMASのタスク性能を大幅に向上させ、数学ベンチマークにおいて平均6.3ポイントの精度向上を達成することが示された。さらに本システムは、タスク難易度に基づいて修正努力量を動的に調整し、文脈を意識した指標を活用して多様な誤りパターンを解決する、強力な一般化性能と適応性を発揮する。コードとデータセットはhttps://github.com/TonySY2/AgentDropoutV2で公開している。

English

While Multi-Agent Systems (MAS) excel in complex reasoning, they suffer from the cascading impact of erroneous information generated by individual participants. Current solutions often resort to rigid structural engineering or expensive fine-tuning, limiting their deployability and adaptability. We propose AgentDropoutV2, a test-time rectify-or-reject pruning framework designed to dynamically optimize MAS information flow without retraining. Our approach acts as an active firewall, intercepting agent outputs and employing a retrieval-augmented rectifier to iteratively correct errors based on a failure-driven indicator pool. This mechanism allows for the precise identification of potential errors using distilled failure patterns as prior knowledge. Irreparable outputs are subsequently pruned to prevent error propagation, while a fallback strategy preserves system integrity. Empirical results on extensive math benchmarks show that AgentDropoutV2 significantly boosts the MAS's task performance, achieving an average accuracy gain of 6.3 percentage points on math benchmarks. Furthermore, the system exhibits robust generalization and adaptivity, dynamically modulating rectification efforts based on task difficulty while leveraging context-aware indicators to resolve a wide spectrum of error patterns. Our code and dataset are released at https://github.com/TonySY2/AgentDropoutV2.

AgentDropoutV2：テスト時における修正または拒否による枝刈りを介したマルチエージェントシステムの情報フロー最適化

AgentDropoutV2: Optimizing Information Flow in Multi-Agent Systems via Test-Time Rectify-or-Reject Pruning

要旨

Support