ChatPaper.aiChatPaper

Hermes 3 技术报告

Hermes 3 Technical Report

August 15, 2024
作者: Ryan Teknium, Jeffrey Quesnelle, Chen Guang
cs.AI

摘要

指导(或“聊天”)微调模型已成为大多数人与大型语言模型互动的主要方式。与“基础”或“基础”模型相反,指导微调模型被优化以响应命令性语句。我们介绍Hermes 3,一个中立对齐的通用指导和工具使用模型,具有强大的推理和创造能力。其最大版本Hermes 3 405B 在几个公共基准测试中实现了开放权重模型的最新性能。
English
Instruct (or "chat") tuned models have become the primary way in which most people interact with large language models. As opposed to "base" or "foundation" models, instruct-tuned models are optimized to respond to imperative statements. We present Hermes 3, a neutrally-aligned generalist instruct and tool use model with strong reasoning and creative abilities. Its largest version, Hermes 3 405B, achieves state of the art performance among open weight models on several public benchmarks.

Summary

AI-Generated Summary

PDF538November 16, 2024