AutoMind: 자동화 데이터 과학을 위한 적응형 지식 기반 에이전트

초록

대형 언어 모델(LLM) 에이전트는 현실 세계의 데이터 과학 문제를 해결하는 데 있어 큰 잠재력을 보여주고 있습니다. LLM 기반 데이터 과학 에이전트는 전체 머신러닝 파이프라인을 자동화할 가능성을 제시하지만, 실제 환경에서의 효과성은 여전히 제한적입니다. 기존 프레임워크는 경직된 사전 정의된 워크플로우와 유연성이 부족한 코딩 전략에 의존하고 있으며, 이로 인해 비교적 단순하고 전통적인 문제에서는 뛰어난 성능을 보이지만, 복잡하고 혁신적인 작업에서 인간 전문가가 제공하는 경험적 전문성을 충분히 반영하지 못합니다. 본 연구에서는 이러한 한계를 극복하기 위해 세 가지 주요 혁신을 도입한 적응형 지식 기반 LLM 에이전트 프레임워크인 AutoMind를 소개합니다: (1) 도메인 전문가 지식을 기반으로 에이전트를 지원하는 선별된 전문 지식 베이스, (2) 가능한 솔루션을 전략적으로 탐색하는 에이전트 기반 지식 트리 탐색 알고리즘, (3) 작업 복잡도에 따라 동적으로 코드 생성을 조정하는 자체 적응형 코딩 전략. 두 가지 자동화된 데이터 과학 벤치마크에서의 평가 결과, AutoMind는 최신 베이스라인 대비 우수한 성능을 보여주었습니다. 추가 분석을 통해 효과성, 효율성, 그리고 질적 솔루션 품질에서 유리한 결과를 확인하였으며, 이는 AutoMind가 완전 자동화된 데이터 과학을 향한 효율적이고 견고한 한 걸음임을 강조합니다.

English

Large Language Model (LLM) agents have shown great potential in addressing real-world data science problems. LLM-driven data science agents promise to automate the entire machine learning pipeline, yet their real-world effectiveness remains limited. Existing frameworks depend on rigid, pre-defined workflows and inflexible coding strategies; consequently, they excel only on relatively simple, classical problems and fail to capture the empirical expertise that human practitioners bring to complex, innovative tasks. In this work, we introduce AutoMind, an adaptive, knowledgeable LLM-agent framework that overcomes these deficiencies through three key advances: (1) a curated expert knowledge base that grounds the agent in domain expert knowledge, (2) an agentic knowledgeable tree search algorithm that strategically explores possible solutions, and (3) a self-adaptive coding strategy that dynamically tailors code generation to task complexity. Evaluations on two automated data science benchmarks demonstrate that AutoMind delivers superior performance versus state-of-the-art baselines. Additional analyses confirm favorable effectiveness, efficiency, and qualitative solution quality, highlighting AutoMind as an efficient and robust step toward fully automated data science.

AutoMind: 자동화 데이터 과학을 위한 적응형 지식 기반 에이전트

AutoMind: Adaptive Knowledgeable Agent for Automated Data Science

초록

Support