ActionStudio: 大規模アクションモデルのデータとトレーニングのための軽量フレームワーク

要旨

アクションモデルは、自律エージェントが複雑なタスクを実行するために不可欠です。しかし、エージェント環境の多様性やエージェントデータの複雑さにより、大規模なアクションモデルのトレーニングは依然として困難です。関心が高まる中、既存のインフラストラクチャはスケーラブルでエージェント固有のファインチューニングを限定的にしかサポートしていません。本論文では、大規模アクションモデル向けに設計された軽量で拡張可能なデータおよびトレーニングフレームワークであるActionStudioを紹介します。ActionStudioは、標準化されたフォーマットを通じて異種混在のエージェント軌跡を統合し、LoRA、完全なファインチューニング、分散セットアップを含む多様なトレーニングパラダイムをサポートし、堅牢な前処理および検証ツールを統合しています。その有効性を公開ベンチマークと現実的な産業ベンチマークの両方で検証し、優れたパフォーマンスと実用的なスケーラビリティを実証しています。コミュニティの研究を促進するため、コードとデータをhttps://github.com/SalesforceAIResearch/xLAMでオープンソース化しました。

English

Action models are essential for enabling autonomous agents to perform complex tasks. However, training large action models remains challenging due to the diversity of agent environments and the complexity of agentic data. Despite growing interest, existing infrastructure provides limited support for scalable, agent-specific fine-tuning. We present ActionStudio, a lightweight and extensible data and training framework designed for large action models. ActionStudio unifies heterogeneous agent trajectories through a standardized format, supports diverse training paradigms including LoRA, full fine-tuning, and distributed setups, and integrates robust preprocessing and verification tools. We validate its effectiveness across both public and realistic industry benchmarks, demonstrating strong performance and practical scalability. We open-sourced code and data at https://github.com/SalesforceAIResearch/xLAM to facilitate research in the community.

ActionStudio: 大規模アクションモデルのデータとトレーニングのための軽量フレームワーク

ActionStudio: A Lightweight Framework for Data and Training of Large Action Models

要旨

Support