Hermes 3 技术报告
Hermes 3 Technical Report
August 15, 2024
作者: Ryan Teknium, Jeffrey Quesnelle, Chen Guang
cs.AI
摘要
指导(或“聊天”)微调模型已成为大多数人与大型语言模型互动的主要方式。与“基础”或“基础”模型相反,指导微调模型被优化以响应命令性语句。我们介绍Hermes 3,一个中立对齐的通用指导和工具使用模型,具有强大的推理和创造能力。其最大版本Hermes 3 405B 在几个公共基准测试中实现了开放权重模型的最新性能。
English
Instruct (or "chat") tuned models have become the primary way in which most
people interact with large language models. As opposed to "base" or
"foundation" models, instruct-tuned models are optimized to respond to
imperative statements. We present Hermes 3, a neutrally-aligned generalist
instruct and tool use model with strong reasoning and creative abilities. Its
largest version, Hermes 3 405B, achieves state of the art performance among
open weight models on several public benchmarks.Summary
AI-Generated Summary