BANG:通过生成式爆炸动力学分割3D资产
BANG: Dividing 3D Assets via Generative Exploded Dynamics
July 29, 2025
作者: Longwen Zhang, Qixuan Zhang, Haoran Jiang, Yinuo Bai, Wei Yang, Lan Xu, Jingyi Yu
cs.AI
摘要
三维创作历来是人类独有的强项,这源于我们运用眼睛、思维与双手对物体进行拆解与重组的能力。然而,现有的三维设计工具难以复现这一自然过程,往往需要深厚的艺术造诣与繁重的手工操作。本文提出BANG,一种创新的生成方法,它架起了三维生成与推理之间的桥梁,实现了对三维物体直观且灵活的部件级分解。BANG的核心在于“生成式爆炸动态”,它为输入的几何体创建一系列流畅的爆炸状态,逐步分离部件的同时保持其几何与语义的一致性。
BANG利用预训练的大规模潜在扩散模型,通过轻量级的爆炸视图适配器进行微调,从而精确控制分解过程。它还引入了时序注意力模块,确保时间维度上的平滑过渡与一致性。BANG通过空间提示(如边界框和表面区域)增强控制,让用户能够指定分解哪些部件及如何分解。这种交互可进一步扩展至多模态模型如GPT-4,实现从二维到三维的操控,为创作流程带来更直观与创新的体验。
BANG的能力不仅限于生成精细的部件级几何,还包括将部件与功能描述关联,促进组件感知的三维创作与制造流程。此外,BANG在3D打印领域也有应用,它生成可分离部件以便于打印与重新组装。本质上,BANG实现了从想象概念到详细三维资产的无缝转换,提供了一种与人类直觉共鸣的全新创作视角。
English
3D creation has always been a unique human strength, driven by our ability to
deconstruct and reassemble objects using our eyes, mind and hand. However,
current 3D design tools struggle to replicate this natural process, requiring
considerable artistic expertise and manual labor. This paper introduces BANG, a
novel generative approach that bridges 3D generation and reasoning, allowing
for intuitive and flexible part-level decomposition of 3D objects. At the heart
of BANG is "Generative Exploded Dynamics", which creates a smooth sequence of
exploded states for an input geometry, progressively separating parts while
preserving their geometric and semantic coherence.
BANG utilizes a pre-trained large-scale latent diffusion model, fine-tuned
for exploded dynamics with a lightweight exploded view adapter, allowing
precise control over the decomposition process. It also incorporates a temporal
attention module to ensure smooth transitions and consistency across time. BANG
enhances control with spatial prompts, such as bounding boxes and surface
regions, enabling users to specify which parts to decompose and how. This
interaction can be extended with multimodal models like GPT-4, enabling
2D-to-3D manipulations for more intuitive and creative workflows.
The capabilities of BANG extend to generating detailed part-level geometry,
associating parts with functional descriptions, and facilitating
component-aware 3D creation and manufacturing workflows. Additionally, BANG
offers applications in 3D printing, where separable parts are generated for
easy printing and reassembly. In essence, BANG enables seamless transformation
from imaginative concepts to detailed 3D assets, offering a new perspective on
creation that resonates with human intuition.