효율적인 에이전트를 향하여: 메모리, 도구 학습, 계획

초록

최근 대규모 언어 모델을 에이전트 시스템으로 확장하려는 관심이 높아지고 있습니다. 에이전트의 효과성은 지속적으로 향상되고 있으나, 실제 현장 적용에 중요한 효율성 측면은 종종 간과되어 왔습니다. 따라서 본 논문은 에이전트의 세 가지 핵심 구성 요소인 메모리, 도구 학습, 계획 수립 측면에서 지연 시간, 토큰 수, 단계 수 등의 비용을 고려하여 효율성을 분석합니다. 에이전트 시스템 자체의 효율성을 포괄적으로 연구하기 위해, 구현 방식은 다르지만 압축 및 관리를 통한 문맥 범위 제한, 도구 호출 최소화를 위한 강화 학습 보상 설계, 효율성 향상을 위한 제어된 탐색 메커니즘 적용 등 높은 수준의 공통 원칙을 공유하는 다양한 최신 접근법을 검토하고 상세히 논의합니다. 이에 따라 우리는 효율성을 두 가지 상보적인 방식으로 규정합니다: 고정된 비용 예산 내에서 효과성을 비교하는 방식과 유사한 효과성 수준에서 비용을 비교하는 방식입니다. 이러한 절충 관계는 효과성과 비용 간 파레토 최적선 관점에서도 살펴볼 수 있습니다. 이러한 관점에서 우리는 각 구성 요소에 대한 평가 프로토콜을 종합하고 벤치마크 및 방법론 연구에서 일반적으로 보고되는 효율성 지표를 통합하여 효율성 중심 벤치마크를 분석합니다. 더 나아가 주요 과제와 미래 방향을 논의함으로써 유용한 통찰을 제공하는 것을 목표로 합니다.

English

Recent years have witnessed increasing interest in extending large language models into agentic systems. While the effectiveness of agents has continued to improve, efficiency, which is crucial for real-world deployment, has often been overlooked. This paper therefore investigates efficiency from three core components of agents: memory, tool learning, and planning, considering costs such as latency, tokens, steps, etc. Aimed at conducting comprehensive research addressing the efficiency of the agentic system itself, we review a broad range of recent approaches that differ in implementation yet frequently converge on shared high-level principles including but not limited to bounding context via compression and management, designing reinforcement learning rewards to minimize tool invocation, and employing controlled search mechanisms to enhance efficiency, which we discuss in detail. Accordingly, we characterize efficiency in two complementary ways: comparing effectiveness under a fixed cost budget, and comparing cost at a comparable level of effectiveness. This trade-off can also be viewed through the Pareto frontier between effectiveness and cost. From this perspective, we also examine efficiency oriented benchmarks by summarizing evaluation protocols for these components and consolidating commonly reported efficiency metrics from both benchmark and methodological studies. Moreover, we discuss the key challenges and future directions, with the goal of providing promising insights.

효율적인 에이전트를 향하여: 메모리, 도구 학습, 계획

Toward Efficient Agents: Memory, Tool learning, and Planning

초록

Support