MiroThinker-1.7與H1：基於驗證機制建構重型研究智能體

摘要

我們推出MiroThinker-1.7，這款新型研究智能體專為複雜長程推理任務而設計。在此基礎上，我們進一步推出MiroThinker-H1，通過強化重型推理能力來實現更可靠的多步驟問題解決。特別值得注意的是，MiroThinker-1.7通過強調結構化規劃、情境推理與工具交互的智能體中期訓練階段，提升了每個交互步驟的可靠性。這使得智能體在複雜任務中能實現更有效的多步驟交互與持續推理。MiroThinker-H1更將驗證機制直接整合至局部與全局層面的推理過程中：中間推理決策可在推論時進行評估與優化，同時對整體推理軌跡進行審計，確保最終答案由連貫的證據鏈所支持。在涵蓋開放網絡研究、科學推理與金融分析的基準測試中，MiroThinker-H1在深度研究任務上達成最先進性能，同時在專業領域保持強勁表現。我們同步開源MiroThinker-1.7與MiroThinker-1.7-mini模型，以顯著提升的效率提供極具競爭力的研究智能體能力。

English

We present MiroThinker-1.7, a new research agent designed for complex long-horizon reasoning tasks. Building on this foundation, we further introduce MiroThinker-H1, which extends the agent with heavy-duty reasoning capabilities for more reliable multi-step problem solving. In particular, MiroThinker-1.7 improves the reliability of each interaction step through an agentic mid-training stage that emphasizes structured planning, contextual reasoning, and tool interaction. This enables more effective multi-step interaction and sustained reasoning across complex tasks. MiroThinker-H1 further incorporates verification directly into the reasoning process at both local and global levels. Intermediate reasoning decisions can be evaluated and refined during inference, while the overall reasoning trajectory is audited to ensure that final answers are supported by coherent chains of evidence. Across benchmarks covering open-web research, scientific reasoning, and financial analysis, MiroThinker-H1 achieves state-of-the-art performance on deep research tasks while maintaining strong results on specialized domains. We also release MiroThinker-1.7 and MiroThinker-1.7-mini as open-source models, providing competitive research-agent capabilities with significantly improved efficiency.

MiroThinker-1.7與H1：基於驗證機制建構重型研究智能體

MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification

摘要

Support