무엇에서 왜로: 증거 기반 화학 반응 조건 추론을 위한 다중 에이전트 시스템

초록

화학 반응 조건 추천은 화학 반응에 적절한 조건 파라미터를 선택하는 것으로, 화학 과학의 가속화에 있어 핵심적인 역할을 합니다. 대규모 언어 모델(LLMs)의 급속한 발전과 함께, 이러한 모델의 추론 및 계획 능력을 활용하여 반응 조건을 추천하려는 관심이 증가하고 있습니다. 기존 방법들은 성공을 거두었음에도 불구하고, 추천된 반응 조건의 근거를 거의 설명하지 않아, 고위험 과학 워크플로에서의 유용성이 제한적이었습니다. 본 연구에서는 조건 예측을 증거 기반 추론 작업으로 재구성한 다중 에이전트 시스템인 ChemMAS를 제안합니다. ChemMAS는 이 작업을 메커니즘 기반 근거 확립, 다중 채널 정보 회상, 제약 조건 인식 에이전트 토론, 그리고 근거 통합으로 분해합니다. 각 결정은 화학 지식과 검색된 선례에 기반한 해석 가능한 근거로 뒷받침됩니다. 실험 결과, ChemMAS는 도메인 특화 베이스라인 대비 20-35%의 성능 향상을 달성했으며, 범용 LLMs보다 Top-1 정확도에서 10-15% 우수한 성능을 보였습니다. 또한, 반증 가능하고 인간이 신뢰할 수 있는 근거를 제공함으로써, 과학적 발견에서 설명 가능한 AI의 새로운 패러다임을 확립했습니다.

English

The chemical reaction recommendation is to select proper reaction condition parameters for chemical reactions, which is pivotal to accelerating chemical science. With the rapid development of large language models (LLMs), there is growing interest in leveraging their reasoning and planning capabilities for reaction condition recommendation. Despite their success, existing methods rarely explain the rationale behind the recommended reaction conditions, limiting their utility in high-stakes scientific workflows. In this work, we propose ChemMAS, a multi-agent system that reframes condition prediction as an evidence-based reasoning task. ChemMAS decomposes the task into mechanistic grounding, multi-channel recall, constraint-aware agentic debate, and rationale aggregation. Each decision is backed by interpretable justifications grounded in chemical knowledge and retrieved precedents. Experiments show that ChemMAS achieves 20-35% gains over domain-specific baselines and outperforms general-purpose LLMs by 10-15% in Top-1 accuracy, while offering falsifiable, human-trustable rationales, which establishes a new paradigm for explainable AI in scientific discovery.

무엇에서 왜로: 증거 기반 화학 반응 조건 추론을 위한 다중 에이전트 시스템

From What to Why: A Multi-Agent System for Evidence-based Chemical Reaction Condition Reasoning

초록

Support