AgentSwing: 장기적 웹 에이전트를 위한 적응형 병렬 컨텍스트 관리 라우팅

초록

대규모 언어 모델(LLM)이 장기적 정보 탐색을 위한 자율 에이전트로 진화함에 따라 유한한 컨텍스트 용량 관리가 중요한 병목 현상으로 대두되고 있습니다. 기존 컨텍스트 관리 방법은 일반적으로 전체 작업 궤적 동안 단일 고정 전략을 고수합니다. 이러한 정적 설계는 특정 상태에서는 효과적일 수 있지만, 장기적 탐색 과정에서 누적된 컨텍스트의 유용성과 신뢰도가 변화함에 따라 적응하지 못하는 한계가 있습니다. 이러한 문제를 정형화하기 위해 우리는 탐색 효율성과 최종 정확도라는 두 가지 상호 보완적인 차원을 통해 장기적 성공을 규정하는 확률론적 프레임워크를 제시합니다. 이러한 관점을 바탕으로 우리는 상태 인식 적응형 병렬 컨텍스트 관리 라우팅 프레임워크인 AgentSwing을 제안합니다. AgentSwing은 각 트리거 포인트에서 여러 컨텍스트 관리 브랜치를 병렬로 확장하고 예측 라우팅을 통해 가장 유망한 후속 경로를 선택합니다. 다양한 벤치마크와 에이전트 백본에서 수행한 실험 결과, AgentSwing이 강력한 정적 컨텍스트 관리 방법들을 지속적으로 능가하며, 최대 3배 적은 상호작용 횟수로 동등하거나 더 높은 성능을 달성함과 동시에 장기적 웹 에이전트의 최종 성능 한계도 향상시키는 것으로 나타났습니다. 실증적 성능 향상 외에도, 제안된 확률론적 프레임워크는 향후 장기적 에이전트를 위한 컨텍스트 관리 전략을 분석하고 설계하는 데 이론적 토대를 제공합니다.

English

As large language models (LLMs) evolve into autonomous agents for long-horizon information-seeking, managing finite context capacity has become a critical bottleneck. Existing context management methods typically commit to a single fixed strategy throughout the entire trajectory. Such static designs may work well in some states, but they cannot adapt as the usefulness and reliability of the accumulated context evolve during long-horizon search. To formalize this challenge, we introduce a probabilistic framework that characterizes long-horizon success through two complementary dimensions: search efficiency and terminal precision. Building on this perspective, we propose AgentSwing, a state-aware adaptive parallel context management routing framework. At each trigger point, AgentSwing expands multiple context-managed branches in parallel and uses lookahead routing to select the most promising continuation. Experiments across diverse benchmarks and agent backbones show that AgentSwing consistently outperforms strong static context management methods, often matching or exceeding their performance with up to 3times fewer interaction turns while also improving the ultimate performance ceiling of long-horizon web agents. Beyond the empirical gains, the proposed probabilistic framework provides a principled lens for analyzing and designing future context management strategies for long-horizon agents.

AgentSwing: 장기적 웹 에이전트를 위한 적응형 병렬 컨텍스트 관리 라우팅

AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents

초록

Support