SciDER: 科学データ中心エンドツーエンド研究者

要旨

大規模言語モデルによる自動化された科学的発見は、研究ライフサイクルを構想段階から実験段階へと変革しつつあるが、既存のエージェントは科学実験で収集された生データを自律的に処理するのに苦戦している。本論文では、研究ライフサイクルを自動化する、データ中心のエンドツーエンドシステムであるSciDERを提案する。従来のフレームワークとは異なり、当システムの専門化されたエージェント群は、生の科学データを協調的に解析・分析し、特定のデータ特性に基づいた仮説と実験計画を生成し、対応するコードを記述・実行する。3つのベンチマークによる評価では、SciDERが専門的なデータ駆動型科学発見において優れた性能を発揮し、自己進化型メモリと批評家主導のフィードバックループを通じて、汎用エージェントや最先端モデルを凌駕することが示された。モジュール型Pythonパッケージとして配布されており、軽量なWebインターフェースを備えた使いやすいPyPIパッケージも提供する。これにより、自律的でデータ駆動型の研究を加速し、すべての研究者と開発者が利用できることを目指している。

English

Automated scientific discovery with large language models is transforming the research lifecycle from ideation to experimentation, yet existing agents struggle to autonomously process raw data collected from scientific experiments. We introduce SciDER, a data-centric end-to-end system that automates the research lifecycle. Unlike traditional frameworks, our specialized agents collaboratively parse and analyze raw scientific data, generate hypotheses and experimental designs grounded in specific data characteristics, and write and execute corresponding code. Evaluation on three benchmarks shows SciDER excels in specialized data-driven scientific discovery and outperforms general-purpose agents and state-of-the-art models through its self-evolving memory and critic-led feedback loop. Distributed as a modular Python package, we also provide easy-to-use PyPI packages with a lightweight web interface to accelerate autonomous, data-driven research and aim to be accessible to all researchers and developers.

SciDER: 科学データ中心エンドツーエンド研究者

SciDER: Scientific Data-centric End-to-end Researcher

要旨

Support