ChemDFM-R: 원자화된 화학 지식으로 강화된 화학 추론 대형 언어 모델

초록

대규모 언어 모델(LLM)이 인상적인 발전을 이루었음에도 불구하고, 화학과 같은 과학 분야에서의 적용은 얕은 도메인 이해와 제한된 추론 능력으로 인해 여전히 어려움을 겪고 있습니다. 본 연구에서는 화학이라는 특정 분야에 초점을 맞추어 화학 추론 LLM인 ChemDFM-R을 개발합니다. 먼저, 모델이 화학의 기본 원리와 논리적 구조를 더 잘 이해할 수 있도록 원자화된 지식 포인트로 구성된 포괄적인 데이터셋을 구축합니다. 그런 다음, 전문가가 선별한 지식과 일반 도메인 추론 능력을 통합한 혼합 소스 증류 전략을 제안하고, 이를 화학 추론을 강화하기 위한 도메인 특화 강화 학습으로 이어갑니다. 다양한 화학 벤치마크에서의 실험 결과, ChemDFM-R은 최첨단 성능을 달성하면서도 해석 가능하고 근거 기반의 출력을 제공함을 보여줍니다. 추가 사례 연구를 통해 명시적인 추론 체인이 실제 인간-AI 협업 시나리오에서 모델의 신뢰성, 투명성 및 실용성을 크게 향상시키는 방식을 설명합니다.

English

While large language models (LLMs) have achieved impressive progress, their application in scientific domains such as chemistry remains hindered by shallow domain understanding and limited reasoning capabilities. In this work, we focus on the specific field of chemistry and develop a Chemical Reasoner LLM, ChemDFM-R. We first construct a comprehensive dataset of atomized knowledge points to enhance the model's understanding of the fundamental principles and logical structure of chemistry. Then, we propose a mix-sourced distillation strategy that integrates expert-curated knowledge with general-domain reasoning skills, followed by domain-specific reinforcement learning to enhance chemical reasoning. Experiments on diverse chemical benchmarks demonstrate that ChemDFM-R achieves state-of-the-art performance while providing interpretable, rationale-driven outputs. Further case studies illustrate how explicit reasoning chains significantly improve the reliability, transparency, and practical utility of the model in real-world human-AI collaboration scenarios.

ChemDFM-R: 원자화된 화학 지식으로 강화된 화학 추론 대형 언어 모델

ChemDFM-R: An Chemical Reasoner LLM Enhanced with Atomized Chemical Knowledge

초록

Support