추론 경제성 활용: 대규모 언어 모델을 위한 효율적 추론 기술 연구 동향

초록

최근 대규모 언어 모델(LLMs)의 발전은 복잡한 추론 작업 수행 능력을 크게 향상시켰으며, 빠르고 직관적인 사고(시스템 1)에서 느리고 깊은 추론(시스템 2)으로의 전환을 이끌었습니다. 시스템 2 추론은 작업 정확도를 향상시키지만, 느린 사고 방식과 비효율적이거나 불필요한 추론 행동으로 인해 상당한 계산 비용이 발생합니다. 반면, 시스템 1 추론은 계산적으로 효율적이지만 최적의 성능을 내지 못합니다. 따라서 성능(이점)과 계산 비용(예산) 사이의 균형을 맞추는 것이 중요하며, 이로 인해 추론 경제성(reasoning economy)이라는 개념이 등장했습니다. 본 조사에서는 LLMs의 사후 학습 및 테스트 시 추론 단계에서의 추론 경제성을 종합적으로 분석하며, i) 추론 비효율성의 원인, ii) 다양한 추론 패턴의 행동 분석, iii) 추론 경제성을 달성하기 위한 잠재적 해결책을 다룹니다. 실행 가능한 통찰을 제공하고 개방된 과제를 강조함으로써, LLMs의 추론 경제성을 개선하기 위한 전략에 대한 통찰을 제공하고, 이 진화하는 분야의 연구를 발전시키는 데 유용한 자료로 활용될 것을 목표로 합니다. 또한, 이 빠르게 진화하는 분야의 발전을 지속적으로 추적하기 위한 공개 저장소를 제공합니다.

English

Recent advancements in Large Language Models (LLMs) have significantly enhanced their ability to perform complex reasoning tasks, transitioning from fast and intuitive thinking (System 1) to slow and deep reasoning (System 2). While System 2 reasoning improves task accuracy, it often incurs substantial computational costs due to its slow thinking nature and inefficient or unnecessary reasoning behaviors. In contrast, System 1 reasoning is computationally efficient but leads to suboptimal performance. Consequently, it is critical to balance the trade-off between performance (benefits) and computational costs (budgets), giving rise to the concept of reasoning economy. In this survey, we provide a comprehensive analysis of reasoning economy in both the post-training and test-time inference stages of LLMs, encompassing i) the cause of reasoning inefficiency, ii) behavior analysis of different reasoning patterns, and iii) potential solutions to achieve reasoning economy. By offering actionable insights and highlighting open challenges, we aim to shed light on strategies for improving the reasoning economy of LLMs, thereby serving as a valuable resource for advancing research in this evolving area. We also provide a public repository to continually track developments in this fast-evolving field.

추론 경제성 활용: 대규모 언어 모델을 위한 효율적 추론 기술 연구 동향

Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models

초록

Support