Hermes 4 技術報告

摘要

我們推出Hermes 4，這是一系列結合結構化多輪推理與廣泛指令遵循能力的混合推理模型。我們詳細描述了在數據整理、合成、訓練和評估過程中遇到的挑戰，並概述了為大規模解決這些挑戰所採用的方案。我們在數學推理、編程、知識、理解及對齊基準上進行了全面評估，並報告了量化性能與質化行為分析。為支持開放研究，所有模型權重已公開發佈於https://huggingface.co/collections/NousResearch/hermes-4-collection-68a731bfd452e20816725728。

English

We present Hermes 4, a family of hybrid reasoning models that combine structured, multi-turn reasoning with broad instruction-following ability. We describe the challenges encountered during data curation, synthesis, training, and evaluation, and outline the solutions employed to address these challenges at scale. We comprehensively evaluate across mathematical reasoning, coding, knowledge, comprehension, and alignment benchmarks, and we report both quantitative performance and qualitative behavioral analysis. To support open research, all model weights are published publicly at https://huggingface.co/collections/NousResearch/hermes-4-collection-68a731bfd452e20816725728