Llama-3.1-FoundationAI-SecurityLLM-Reasoning-8B 技術レポート

要旨

私たちは、セキュリティ分野初のオープンソースネイティブ推論モデルであるFoundation-Sec-8B-Reasoningを発表します。以前リリースしたFoundation-Sec-8Bベースモデル（Llama-3.1-8B-Base由来）を基盤とし、教師ありファインチューニング（SFT）と検証可能な報酬からの強化学習（RLVR）を組み合わせた2段階のプロセスで学習されました。学習には、セキュリティ分析、指示追従、数学的推論にわたる独自の推論データを活用しています。10のセキュリティベンチマークと10の汎用ベンチマークによる評価では、セキュリティタスクにおいて大幅に大規模なモデルと競合する性能を示しつつ、強力な汎用能力を維持していることが実証されました。本モデルは、マルチホップ推論タスクでの効果的な一般化と、適切なシステムプロンプトとガードレールを導入した際の優れた安全性性能を示します。この成果は、ドメイン特化型の推論モデルが、専門タスクで強力な性能を発揮しつつ、広範な汎用能力を維持できることを実証しています。本モデルはhttps://huggingface.co/fdtn-ai/Foundation-Sec-8B-Reasoning で公開しています。

English

We present Foundation-Sec-8B-Reasoning, the first open-source native reasoning model for cybersecurity. Built upon our previously released Foundation-Sec-8B base model (derived from Llama-3.1-8B-Base), the model is trained through a two-stage process combining supervised fine-tuning (SFT) and reinforcement learning from verifiable rewards (RLVR). Our training leverages proprietary reasoning data spanning cybersecurity analysis, instruction-following, and mathematical reasoning. Evaluation across 10 cybersecurity benchmarks and 10 general-purpose benchmarks demonstrates performance competitive with significantly larger models on cybersecurity tasks while maintaining strong general capabilities. The model shows effective generalization on multi-hop reasoning tasks and strong safety performance when deployed with appropriate system prompts and guardrails. This work demonstrates that domain-specialized reasoning models can achieve strong performance on specialized tasks while maintaining broad general capabilities. We release the model publicly at https://huggingface.co/fdtn-ai/Foundation-Sec-8B-Reasoning.

Llama-3.1-FoundationAI-SecurityLLM-Reasoning-8B 技術レポート

Llama-3.1-FoundationAI-SecurityLLM-Reasoning-8B Technical Report

要旨

Support