ChatPaper.aiChatPaper

BANG:基于生成式爆炸动力学的三维资产分割

BANG: Dividing 3D Assets via Generative Exploded Dynamics

July 29, 2025
作者: Longwen Zhang, Qixuan Zhang, Haoran Jiang, Yinuo Bai, Wei Yang, Lan Xu, Jingyi Yu
cs.AI

摘要

三維創作一直是人類獨特的強項,這源於我們能夠運用眼睛、心智和手來解構並重組物體。然而,當前的三維設計工具難以複製這一自然過程,需要大量的藝術專業知識和手工勞動。本文介紹了BANG,這是一種新穎的生成方法,它橋接了三維生成與推理,允許對三維物體進行直觀且靈活的部件級分解。BANG的核心是“生成式爆炸動態”,它為輸入的幾何體創建了一系列平滑的爆炸狀態,逐步分離部件,同時保持其幾何和語義的連貫性。 BANG利用了一個預訓練的大規模潛在擴散模型,並通過輕量級的爆炸視圖適配器進行微調,以實現對分解過程的精確控制。它還整合了一個時間注意力模塊,確保了平滑的過渡和時間上的一致性。BANG通過空間提示(如邊界框和表面區域)增強了控制,使用戶能夠指定要分解的部件及其方式。這種互動可以通過多模態模型(如GPT-4)進行擴展,實現從二維到三維的操作,從而提供更直觀和創意的工作流程。 BANG的能力還包括生成詳細的部件級幾何體、將部件與功能描述關聯起來,以及促進組件感知的三維創作和製造工作流程。此外,BANG在3D打印中也有應用,它生成可分離的部件以便於打印和重新組裝。本質上,BANG實現了從想像概念到詳細三維資產的無縫轉換,提供了一種與人類直覺共鳴的創作新視角。
English
3D creation has always been a unique human strength, driven by our ability to deconstruct and reassemble objects using our eyes, mind and hand. However, current 3D design tools struggle to replicate this natural process, requiring considerable artistic expertise and manual labor. This paper introduces BANG, a novel generative approach that bridges 3D generation and reasoning, allowing for intuitive and flexible part-level decomposition of 3D objects. At the heart of BANG is "Generative Exploded Dynamics", which creates a smooth sequence of exploded states for an input geometry, progressively separating parts while preserving their geometric and semantic coherence. BANG utilizes a pre-trained large-scale latent diffusion model, fine-tuned for exploded dynamics with a lightweight exploded view adapter, allowing precise control over the decomposition process. It also incorporates a temporal attention module to ensure smooth transitions and consistency across time. BANG enhances control with spatial prompts, such as bounding boxes and surface regions, enabling users to specify which parts to decompose and how. This interaction can be extended with multimodal models like GPT-4, enabling 2D-to-3D manipulations for more intuitive and creative workflows. The capabilities of BANG extend to generating detailed part-level geometry, associating parts with functional descriptions, and facilitating component-aware 3D creation and manufacturing workflows. Additionally, BANG offers applications in 3D printing, where separable parts are generated for easy printing and reassembly. In essence, BANG enables seamless transformation from imaginative concepts to detailed 3D assets, offering a new perspective on creation that resonates with human intuition.
PDF523July 31, 2025