Autodata: 고품질 합성 데이터를 생성하는 에이전트 기반 데이터 과학자

초록

Autodata를 소개합니다. 이는 AI 에이전트가 데이터 과학자 역할을 수행하여 고품질의 훈련 및 평가 데이터를 구축할 수 있도록 하는 일반적인 방법입니다. 우리는 이러한 데이터 과학자 에이전트를 훈련(메타 최적화)하여 더욱 강력한 데이터를 생성하는 방법을 학습시키는 과정을 제시합니다. 전반적인 공식화와 구체적인 실제 구현인 Agentic Self-Instruct를 설명하고, 컴퓨터 과학 연구 과제, 법적 추론 과제, 수학적 객체를 활용한 추론 과제에 대한 실험을 수행합니다. 해당 실험에서 기존의 합성 데이터셋 생성 방법에 비해 개선된 결과를 얻었습니다. 나아가 데이터 과학자 에이전트 자체를 메타 최적화함으로써 더 큰 성능 향상을 달성했습니다. 에이전틱 데이터 생성은 증가된 추론 컴퓨팅 자원을 더 높은 품질의 모델 훈련으로 전환하는 방법을 제공합니다. 전반적으로, 이 방향이 AI 데이터 구축 방식을 변화시킬 잠재력을 가진다고 믿습니다.

English

We introduce Autodata, a general method that enables AI agents to act as data scientists who build high quality training and evaluation data. We show how to train (meta-optimize) such a data scientist agent, so that it learns to create even stronger data. We describe the overall formulation, and a specific practical implementation, Agentic Self-Instruct. We conduct experiments on computer science research tasks, legal reasoning tasks and reasoning with mathematical objects, where we obtain improved results compared to classical synthetic dataset creation methods. Further, meta-optimizing the data scientist agent itself delivers an even larger performance uplift. Agentic data creation provides a way to convert increased inference compute into higher quality model training. Overall, we believe this direction has the potential to change the way we build AI data.