AgentDropoutV2: 테스트 타임 수정-또는-거절 가지치기를 통한 다중 에이전트 시스템 정보 흐름 최적화

초록

다중 에이전트 시스템(MAS)은 복잡한 추론에서 탁월한 성능을 보이지만, 개별 참가자가 생성한 오류 정보의 연쇄적 영향에 취약합니다. 기존 솔루션은 경직된 구조 설계나 고비용 미세 조정에 의존하는 경우가 많아 배포성과 적응성이 제한됩니다. 본 연구에서는 재학습 없이 MAS 정보 흐름을 동적으로 최적화하기 위해 설계된 테스트 시점 수정-또는-거절 프레임워크인 AgentDropoutV2를 제안합니다. 우리의 접근 방식은 능동형 방화벽처럼 작동하여 에이전트 출력을 차단하고, 검색 기반 수정기를 활용하여 실패 기반 지표 풀을 바탕으로 오류를 반복적으로 수정합니다. 이 메커니즘은 정제된 실패 패턴을 사전 지식으로 활용하여 잠재적 오류를 정확하게 식별할 수 있게 합니다. 수복 불가능한 출력은 오류 전파를 방지하기 위해 제거되며, 폴백 전략을 통해 시스템 무결성을 유지합니다. 다양한 수학 벤치마크에 대한 실험 결과, AgentDropoutV2가 MAS의 작업 성능을 크게 향상시켜 수학 벤치마크에서 평균 6.3%p의 정확도 향상을 달성했음을 보여줍니다. 더 나아가 본 시스템은 과제 난이도에 기반하여 수정 노력을 동적으로 조절하고, 상황 인식 지표를 활용하여 다양한 오류 패턴을 해결함으로써 강력한 일반화 능력과 적응성을 나타냅니다. 우리의 코드와 데이터셋은 https://github.com/TonySY2/AgentDropoutV2에서 공개되었습니다.

English

While Multi-Agent Systems (MAS) excel in complex reasoning, they suffer from the cascading impact of erroneous information generated by individual participants. Current solutions often resort to rigid structural engineering or expensive fine-tuning, limiting their deployability and adaptability. We propose AgentDropoutV2, a test-time rectify-or-reject pruning framework designed to dynamically optimize MAS information flow without retraining. Our approach acts as an active firewall, intercepting agent outputs and employing a retrieval-augmented rectifier to iteratively correct errors based on a failure-driven indicator pool. This mechanism allows for the precise identification of potential errors using distilled failure patterns as prior knowledge. Irreparable outputs are subsequently pruned to prevent error propagation, while a fallback strategy preserves system integrity. Empirical results on extensive math benchmarks show that AgentDropoutV2 significantly boosts the MAS's task performance, achieving an average accuracy gain of 6.3 percentage points on math benchmarks. Furthermore, the system exhibits robust generalization and adaptivity, dynamically modulating rectification efforts based on task difficulty while leveraging context-aware indicators to resolve a wide spectrum of error patterns. Our code and dataset are released at https://github.com/TonySY2/AgentDropoutV2.

AgentDropoutV2: 테스트 타임 수정-또는-거절 가지치기를 통한 다중 에이전트 시스템 정보 흐름 최적화

AgentDropoutV2: Optimizing Information Flow in Multi-Agent Systems via Test-Time Rectify-or-Reject Pruning

초록

Support