ChatPaper.aiChatPaper

Hermes 3 技術報告

Hermes 3 Technical Report

August 15, 2024
作者: Ryan Teknium, Jeffrey Quesnelle, Chen Guang
cs.AI

摘要

指導(或“聊天”)微調模型已成為大多數人與大型語言模型互動的主要方式。與“基礎”或“基礎”模型相反,指導微調模型被優化以回應命令性陳述。我們提出了Hermes 3,一個中立對齊的通用指導和工具使用模型,具有強大的推理和創造能力。其最大版本,Hermes 3 405B,在幾個公共基準測試中實現了開放權重模型的最新性能。
English
Instruct (or "chat") tuned models have become the primary way in which most people interact with large language models. As opposed to "base" or "foundation" models, instruct-tuned models are optimized to respond to imperative statements. We present Hermes 3, a neutrally-aligned generalist instruct and tool use model with strong reasoning and creative abilities. Its largest version, Hermes 3 405B, achieves state of the art performance among open weight models on several public benchmarks.

Summary

AI-Generated Summary

PDF538November 16, 2024