ARIS: 적대적 다중 에이전트 협업을 통한 자율적 연구

초록

이 보고서는 자율 연구를 위한 오픈소스 연구 하네스인 ARIS(Auto-Research-in-sleep)의 아키텍처, 보증 메커니즘 및 초기 적용 경험을 설명합니다. LLM 기반 에이전트 시스템의 성능은 모델 가중치와 모델에 제공할 정보의 저장, 검색, 제시를 관리하는 하네스 양자에 의존합니다. 장기적인 연구 워크플로우에서 주요 실패 모드는 가시적인 중단이 아니라, 지지 근거가 불완전하거나 잘못 보고되었거나 실행자의 프레이밍에서 암묵적으로 상속된 주장을 생성하는 그럴듯한 비지지 성공입니다. 따라서 본고에서는 기본 구성으로 상호 모델 간 적대적 협업을 통해 머신러닝 연구 워크플로우를 조정하는 연구 하네스로서 ARIS를 제시합니다: 실행자 모델이 진행을 주도하는 동시에 다른 모델 패밀리의 검토자가 중간 산출물을 비판하고 수정을 요청하도록 권장됩니다. ARIS는 세 가지 아키텍처 계층으로 구성됩니다. 실행 계층은 65개 이상의 재사용 가능한 Markdown 정의 스킬, MCP를 통한 모델 통합, 이전 연구 결과의 반복적 재사용을 위한 지속적 연구 위키, 결정론적 그림 생성을 제공합니다. 오케스트레이션 계층은 조정 가능한 노력 설정과 검토자 모델로의 구성 가능한 라우팅을 갖춘 5가지 종단 간 워크플로우를 조정합니다. 보증 계층은 실험적 주장이 증거에 의해 지지되는지 확인하는 3단계 프로세스(무결성 검증, 결과-주장 매핑, 원고 진술을 주장 원장 및 원시 증거와 교차 검증하는 주장 감사)와 5단계 과학 편집 파이프라인, 수학적 증명 검사, 렌더링된 PDF의 시각적 검사를 포함합니다. 프로토타입 자기 개선 루프는 연구 흔적을 기록하고 검토자 승인 후에만 채택되는 하네스 개선안을 제안합니다.

English

This report describes ARIS (Auto-Research-in-sleep), an open-source research harness for autonomous research, including its architecture, assurance mechanisms, and early deployment experience. The performance of agent systems built on LLMs depends on both the model weights and the harness around them, which governs what information to store, retrieve, and present to the model. For long-horizon research workflows, the central failure mode is not a visible breakdown but a plausible unsupported success: a long-running agent can produce claims whose evidential support is incomplete, misreported, or silently inherited from the executor's framing. Therefore, we present ARIS as a research harness that coordinates machine-learning research workflows through cross-model adversarial collaboration as a default configuration: an executor model drives forward progress while a reviewer from a different model family is recommended to critique intermediate artifacts and request revisions. ARIS has three architectural layers. The execution layer provides more than 65 reusable Markdown-defined skills, model integrations via MCP, a persistent research wiki for iterative reuse of prior findings, and deterministic figure generation. The orchestration layer coordinates five end-to-end workflows with adjustable effort settings and configurable routing to reviewer models. The assurance layer includes a three-stage process for checking whether experimental claims are supported by evidence: integrity verification, result-to-claim mapping, and claim auditing that cross-checks manuscript statements against the claim ledger and raw evidence, as well as a five-pass scientific-editing pipeline, mathematical-proof checks, and visual inspection of the rendered PDF. A prototype self-improvement loop records research traces and proposes harness improvements that are adopted only after reviewer approval.

ARIS: 적대적 다중 에이전트 협업을 통한 자율적 연구

ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration

초록

Support