ChatPaper.aiChatPaper

Llama-3.1-基础AI安全大模型-8B指令版技术报告

Llama-3.1-FoundationAI-SecurityLLM-8B-Instruct Technical Report

August 1, 2025
作者: Sajana Weerawardhena, Paul Kassianik, Blaine Nelson, Baturay Saglam, Anu Vellore, Aman Priyanshu, Supriti Vijay, Massimo Aufiero, Arthur Goldblatt, Fraser Burch, Ed Li, Jianliang He, Dhruv Kedia, Kojin Oshiba, Zhouran Yang, Yaron Singer, Amin Karbasi
cs.AI

摘要

大型语言模型(LLMs)在众多领域展现了显著的成功,然而其在网络安全应用中的整合仍显不足,原因在于缺乏通用网络安全数据、表征复杂性以及安全与监管顾虑。为填补这一空白,我们先前推出了Foundation-Sec-8B,这是一款专为网络安全设计、适合下游任务微调的LLM。然而,该模型并未针对聊天式交互或指令遵循进行优化。在本报告中,我们发布了Foundation-Sec-8B-Instruct:一款专门训练用于通用网络安全对话的模型。基于Foundation-Sec-8B构建,它融合了领域专业知识、指令遵循能力、对话技巧及与人类偏好的对齐,以生成高质量、相关性强的响应。全面评估显示,Foundation-Sec-8B-Instruct在一系列网络安全任务上超越了Llama 3.1-8B-Instruct,同时在指令遵循性能上与之相当。在网络安全威胁情报和指令遵循任务上,它也与GPT-4o-mini旗鼓相当。我们预见Foundation-Sec-8B-Instruct将成为网络安全专业人员日常工作中不可或缺的助手。该模型已公开发布于https://huggingface.co/fdtn-ai/Foundation-Sec-8B-Instruct。
English
Large language models (LLMs) have shown remarkable success across many domains, yet their integration into cybersecurity applications remains limited due to a lack of general-purpose cybersecurity data, representational complexity, and safety and regulatory concerns. To address this gap, we previously introduced Foundation-Sec-8B, a cybersecurity-focused LLM suitable for fine-tuning on downstream tasks. That model, however, was not designed for chat-style interactions or instruction-following. In this report, we release Foundation-Sec-8B-Instruct: a model specifically trained for general-purpose cybersecurity dialogue. Built on Foundation-Sec-8B, it combines domain-specific knowledge with instruction-following, conversational capabilities, and alignment with human preferences to produce high-quality, relevant responses. Comprehensive evaluations show that Foundation-Sec-8B-Instruct outperforms Llama 3.1-8B-Instruct on a range of cybersecurity tasks while matching its instruction-following performance. It is also competitive with GPT-4o-mini on cyber threat intelligence and instruction-following tasks. We envision Foundation-Sec-8B-Instruct becoming an indispensable assistant in the daily workflows of cybersecurity professionals. We release the model publicly at https://huggingface.co/fdtn-ai/Foundation-Sec-8B-Instruct.
PDF242August 5, 2025