ChatPaper.aiChatPaper

CoRe3D:以协同推理为基石构建三维智能

CoRe3D: Collaborative Reasoning as a Foundation for 3D Intelligence

December 14, 2025
作者: Tianjiao Yu, Xinzhuo Li, Yifan Shen, Yuanzhe Liu, Ismini Lourentzou
cs.AI

摘要

大型多模态模型的最新进展表明,显式推理机制对提升模型可靠性、可解释性及跨模态对齐具有关键作用。尽管这类以推理为核心的方法已在语言与视觉任务中被证明有效,但其向三维领域的拓展仍显不足。CoRe3D提出了一种统一的三维理解与生成推理框架,该框架能协同处理语义与空间抽象信息,使从语言推断出的高层意图直接指导低层三维内容的生成。该设计的核心在于一种空间锚定的推理表征,它将三维潜在空间分解为局部区域,使模型能够以组合式、流程化的方式对几何结构进行推理。通过将语义链式推理与结构化空间推理紧密耦合,CoRe3D生成的三维输出既保持了强烈的局部一致性,又与语言描述实现精准对齐。
English
Recent advances in large multimodal models suggest that explicit reasoning mechanisms play a critical role in improving model reliability, interpretability, and cross-modal alignment. While such reasoning-centric approaches have been proven effective in language and vision tasks, their extension to 3D remains underdeveloped. CoRe3D introduces a unified 3D understanding and generation reasoning framework that jointly operates over semantic and spatial abstractions, enabling high-level intent inferred from language to directly guide low-level 3D content formation. Central to this design is a spatially grounded reasoning representation that decomposes 3D latent space into localized regions, allowing the model to reason over geometry in a compositional and procedural manner. By tightly coupling semantic chain-of-thought inference with structured spatial reasoning, CoRe3D produces 3D outputs that exhibit strong local consistency and faithful alignment with linguistic descriptions.
PDF12December 17, 2025