ToolRosetta: 자동화된 도구 표준화를 통해 오픈소스 저장소와 대규모 언어 모델 에이전트 연결하기

초록

기존 코드의 재사용과 호출은 여전히 비용이 많이 들고 신뢰하기 어려운 실정인데, 이는 대부분의 실용적인 도구들이 이기종 코드 저장소에 내재되어 있고 표준화된 실행 가능한 인터페이스가 부족하기 때문입니다. 대규모 언어 모델(LLM)과 Model Context Protocol(MCP) 기반 도구 호출 프레임워크가 자연어 작업 실행을 가능하게 하지만, 현재의 접근 방식은 확장성을 근본적으로 제한하는 수동 도구 선별 및 표준화에 크게 의존하고 있습니다. 본 논문에서는 오픈소스 코드 저장소와 API를 LLM이 안정적으로 호출할 수 있는 MCP 호환 도구로 자동 변환하는 통합 프레임워크인 ToolRosetta를 제안합니다. ToolRosetta는 사용자 작업이 주어지면 도구 체인을 자율적으로 계획하고 관련 코드베이스를 식별한 후 이를 실행 가능한 MCP 서비스로 변환하여 최소한의 인간 개입으로 종단간 작업 완성을 가능하게 합니다. 또한 ToolRosetta는 임의 코드 실행에 내재된 위험을 완화하기 위한 보안 검사 계층을 포함합니다. 다양한 과학 분야에 걸친 폭넓은 실험을 통해 ToolRosetta가 대량의 오픈소스 도구를 자동으로 표준화하고 코드 재현 및 배포에 필요한 인간의 노력을 줄일 수 있음을 입증했습니다. 특히, 특화된 오픈소스 도구를 원활하게 활용함으로써 ToolRosetta 기반 에이전트는 상용 LLM 및 기존 에이전트 시스템과 비교해 작업 완성 성능을 지속적으로 향상시킵니다.

English

Reusing and invoking existing code remains costly and unreliable, as most practical tools are embedded in heterogeneous code repositories and lack standardized, executable interfaces. Although large language models (LLMs) and Model Context Protocol (MCP)-based tool invocation frameworks enable natural language task execution, current approaches rely heavily on manual tool curation and standardization, which fundamentally limits scalability. In this paper, we propose ToolRosetta, a unified framework that automatically translates open-source code repositories and APIs into MCP-compatible tools that can be reliably invoked by LLMs. Given a user task, ToolRosetta autonomously plans toolchains, identifies relevant codebases, and converts them into executable MCP services, enabling end-to-end task completion with minimal human intervention. In addition, ToolRosetta incorporates a security inspection layer to mitigate risks inherent in executing arbitrary code. Extensive experiments across diverse scientific domains demonstrate that ToolRosetta can automatically standardize a large number of open-source tools and reduce the human effort required for code reproduction and deployment. Notably, by seamlessly leveraging specialized open-source tools, ToolRosetta-powered agents consistently improve task completion performance compared to commercial LLMs and existing agent systems.

ToolRosetta: 자동화된 도구 표준화를 통해 오픈소스 저장소와 대규모 언어 모델 에이전트 연결하기

ToolRosetta: Bridging Open-Source Repositories and Large Language Model Agents through Automated Tool Standardization

초록

Support