SaulLM-54B & SaulLM-141B: 法務ドメインにおけるドメイン適応のスケールアップ

要旨

本論文では、法律分野に特化した2つの大規模言語モデル（LLM）、SaulLM-54BとSaulLM-141Bを紹介する。これらのモデルは、それぞれ540億と1410億のパラメータを備えたMixtralアーキテクチャを基盤としている。SaulLM-54BとSaulLM-141Bの開発は、大規模なドメイン適応に基づいており、以下の3つの戦略に分けられる：(1) 5400億以上の法律関連トークンを含む基本コーパスを用いた継続的な事前学習の活用、(2) 法律分野に特化した指示追従プロトコルの実装、(3) 法律解釈におけるモデル出力と人間の選好の整合。第2および第3のステップでは、合成生成データを統合することで、法律文書の解釈と処理におけるモデルの能力が向上し、LegalBench-Instructにおいて最先端の性能を達成し、従来のオープンソースモデルを凌駕した。本研究は、この規模でのドメイン特化適応に伴うトレードオフを探り、強力なデコーダモデルを用いたドメイン適応に関する将来の研究に役立つ知見を提供する。SaulLM-7Bを基盤として、本研究は法律タスクに適したLLMを生成するためのアプローチを洗練させた。再利用と共同研究を促進するため、SaulLM-54BとSaulLM-141Bの基本版、指示版、および整合版をMITライセンスの下で公開する。

English

In this paper, we introduce SaulLM-54B and SaulLM-141B, two large language models (LLMs) tailored for the legal sector. These models, which feature architectures of 54 billion and 141 billion parameters, respectively, are based on the Mixtral architecture. The development of SaulLM-54B and SaulLM-141B is guided by large-scale domain adaptation, divided into three strategies: (1) the exploitation of continued pretraining involving a base corpus that includes over 540 billion of legal tokens, (2) the implementation of a specialized legal instruction-following protocol, and (3) the alignment of model outputs with human preferences in legal interpretations. The integration of synthetically generated data in the second and third steps enhances the models' capabilities in interpreting and processing legal texts, effectively reaching state-of-the-art performance and outperforming previous open-source models on LegalBench-Instruct. This work explores the trade-offs involved in domain-specific adaptation at this scale, offering insights that may inform future studies on domain adaptation using strong decoder models. Building upon SaulLM-7B, this study refines the approach to produce an LLM better equipped for legal tasks. We are releasing base, instruct, and aligned versions on top of SaulLM-54B and SaulLM-141B under the MIT License to facilitate reuse and collaborative research.

SaulLM-54B & SaulLM-141B: 法務ドメインにおけるドメイン適応のスケールアップ

SaulLM-54B & SaulLM-141B: Scaling Up Domain Adaptation for the Legal Domain

要旨

Support