ChatPaper.aiChatPaper

Spacer:迈向工程化的科学灵感

Spacer: Towards Engineered Scientific Inspiration

August 25, 2025
作者: Minhyeong Lee, Suyoung Hwang, Seunghyun Moon, Geonho Nah, Donghyun Koh, Youngjun Cho, Johyun Park, Hojin Yoo, Jiho Park, Haneul Choi, Sungbin Moon, Taehoon Hwang, Seungwon Kim, Jaeyeong Kim, Seongjun Kim, Juneau Jung
cs.AI

摘要

近期大语言模型(LLMs)的进展,已将自动化科学研究推向了通往人工超级智能的前沿阵地。然而,这些系统要么局限于特定任务,要么受限于LLMs有限的创造力。我们提出了Spacer,一个无需外部干预即可生成创意且基于事实的科学发现系统。Spacer通过“刻意去语境化”实现这一目标,该方法将信息拆解为原子单元——关键词,并从这些关键词间未被探索的联系中汲取创意。Spacer由两部分组成:(i) Nuri,一个构建关键词集的灵感引擎,以及(ii) 将关键词集精炼为详尽科学陈述的“显化管道”。Nuri从包含18万篇生物学领域学术文献构建的关键词图中提取新颖且高潜力的关键词集。“显化管道”则寻找关键词间的联系,分析其逻辑结构,验证其合理性,并最终起草原创的科学概念。实验表明,Nuri的评估指标以0.737的AUROC分数准确分类了高影响力文献。我们的“显化管道”也成功仅凭关键词集重建了最新顶级期刊文章的核心概念。基于LLM的评分系统估计,这一重建在超过85%的情况下是可靠的。最后,我们的嵌入空间分析显示,与当前最先进的LLMs相比,Spacer的输出与领先出版物显著更为相似。
English
Recent advances in LLMs have made automated scientific research the next frontline in the path to artificial superintelligence. However, these systems are bound either to tasks of narrow scope or the limited creative capabilities of LLMs. We propose Spacer, a scientific discovery system that develops creative and factually grounded concepts without external intervention. Spacer attempts to achieve this via 'deliberate decontextualization,' an approach that disassembles information into atomic units - keywords - and draws creativity from unexplored connections between them. Spacer consists of (i) Nuri, an inspiration engine that builds keyword sets, and (ii) the Manifesting Pipeline that refines these sets into elaborate scientific statements. Nuri extracts novel, high-potential keyword sets from a keyword graph built with 180,000 academic publications in biological fields. The Manifesting Pipeline finds links between keywords, analyzes their logical structure, validates their plausibility, and ultimately drafts original scientific concepts. According to our experiments, the evaluation metric of Nuri accurately classifies high-impact publications with an AUROC score of 0.737. Our Manifesting Pipeline also successfully reconstructs core concepts from the latest top-journal articles solely from their keyword sets. An LLM-based scoring system estimates that this reconstruction was sound for over 85% of the cases. Finally, our embedding space analysis shows that outputs from Spacer are significantly more similar to leading publications compared with those from SOTA LLMs.
PDF201August 27, 2025