MiroThinker-1.7 & H1: 검증을 통한 고성능 연구 에이전트 개발

초록

저희는 복잡한 장기 추론 과제를 위해 설계된 새로운 연구 에이전트인 MiroThinker-1.7을 소개합니다. 이를 기반으로, 더욱 신뢰할 수 있는 다단계 문제 해결을 위한 고성능 추론 능력을 갖춘 MiroThinker-H1을 추가로 선보입니다. 특히 MiroThinker-1.7은 구조화된 계획 수립, 맥락적 추론, 도구 상호작용을 강조하는 에이전트 중간 훈련 단계를 통해 각 상호작용 단계의 신뢰도를 향상시킵니다. 이를 통해 복잡한 작업에 걸쳐 더 효과적인 다단계 상호작용과 지속적 추론이 가능해집니다. MiroThinker-H1은 추론 과정에 지역적 및 전역적 수준에서 검증 기능을 직접 통합합니다. 추론 과정에서 중간 추론 결정을 평가하고 개선할 수 있으며, 전반적인 추론 궤적을 검토하여 최종 답변이 일관된 증거 사슬에 의해 뒷받침되도록 합니다. 오픈 웹 연구, 과학적 추론, 금융 분석을 아우르는 벤치마크에서 MiroThinker-H1은 특화된 영역에서도 강력한 성능을 유지하면서 심층 연구 과제에서 최첨단 성능을 달성했습니다. 또한 MiroThinker-1.7과 MiroThinker-1.7-mini를 오픈소스 모델로 공개하여 경쟁력 있는 연구 에이전트 능력과 크게 향상된 효율성을 제공합니다.

English

We present MiroThinker-1.7, a new research agent designed for complex long-horizon reasoning tasks. Building on this foundation, we further introduce MiroThinker-H1, which extends the agent with heavy-duty reasoning capabilities for more reliable multi-step problem solving. In particular, MiroThinker-1.7 improves the reliability of each interaction step through an agentic mid-training stage that emphasizes structured planning, contextual reasoning, and tool interaction. This enables more effective multi-step interaction and sustained reasoning across complex tasks. MiroThinker-H1 further incorporates verification directly into the reasoning process at both local and global levels. Intermediate reasoning decisions can be evaluated and refined during inference, while the overall reasoning trajectory is audited to ensure that final answers are supported by coherent chains of evidence. Across benchmarks covering open-web research, scientific reasoning, and financial analysis, MiroThinker-H1 achieves state-of-the-art performance on deep research tasks while maintaining strong results on specialized domains. We also release MiroThinker-1.7 and MiroThinker-1.7-mini as open-source models, providing competitive research-agent capabilities with significantly improved efficiency.

MiroThinker-1.7 & H1: 검증을 통한 고성능 연구 에이전트 개발

MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification

초록

Support