헤르메스 3 기술 보고서

초록

지시(또는 "대화") 튜닝된 모델은 대부분의 사람들이 대규모 언어 모델과 상호 작용하는 주요 방법이 되었습니다. "기본" 또는 "기초" 모델과는 달리, 지시 튜닝된 모델은 명령문에 응답할 수 있도록 최적화되어 있습니다. 우리는 강력한 추론 및 창의적 능력을 갖춘 중립적으로 정렬된 일반적인 지시 및 도구 사용 모델인 Hermes 3를 제시합니다. 그 가장 큰 버전인 Hermes 3 405B는 여러 공개 벤치마크에서 오픈 웨이트 모델 중 최고 수준의 성능을 달성합니다.

English

Instruct (or "chat") tuned models have become the primary way in which most people interact with large language models. As opposed to "base" or "foundation" models, instruct-tuned models are optimized to respond to imperative statements. We present Hermes 3, a neutrally-aligned generalist instruct and tool use model with strong reasoning and creative abilities. Its largest version, Hermes 3 405B, achieves state of the art performance among open weight models on several public benchmarks.