헤르메스 4 기술 보고서

초록

우리는 구조화된 다중 턴 추론과 광범위한 명령 수행 능력을 결합한 하이브리드 추론 모델 패밀리인 Hermes 4를 소개한다. 데이터 큐레이션, 합성, 훈련 및 평가 과정에서 직면한 과제들을 설명하고, 이러한 과제를 대규모로 해결하기 위해 채택한 솔루션들을 개요한다. 수학적 추론, 코딩, 지식, 이해 및 정렬 벤치마크에 걸쳐 포괄적으로 평가를 수행하며, 양적 성능과 질적 행동 분석을 모두 보고한다. 개방형 연구를 지원하기 위해 모든 모델 가중치는 https://huggingface.co/collections/NousResearch/hermes-4-collection-68a731bfd452e20816725728에서 공개적으로 발표되었다.

English

We present Hermes 4, a family of hybrid reasoning models that combine structured, multi-turn reasoning with broad instruction-following ability. We describe the challenges encountered during data curation, synthesis, training, and evaluation, and outline the solutions employed to address these challenges at scale. We comprehensively evaluate across mathematical reasoning, coding, knowledge, comprehension, and alignment benchmarks, and we report both quantitative performance and qualitative behavioral analysis. To support open research, all model weights are published publicly at https://huggingface.co/collections/NousResearch/hermes-4-collection-68a731bfd452e20816725728

헤르메스 4 기술 보고서

Hermes 4 Technical Report

초록

Support