Pro3D-Editor:一種基於漸進視角的一致且精確的三維編輯方法
Pro3D-Editor : A Progressive-Views Perspective for Consistent and Precise 3D Editing
May 31, 2025
作者: Yang Zheng, Mengqi Huang, Nan Chen, Zhendong Mao
cs.AI
摘要
基於文本引導的三維編輯旨在精確地修改語義相關的局部三維區域,這在從三維遊戲到電影製作等多種實際應用中具有顯著潛力。現有方法通常遵循一種視角無差別的模式:不加區分地編輯二維視圖並將其投影回三維空間。然而,這些方法忽視了不同視角間的相互依賴性,導致多視角編輯的不一致性。在本研究中,我們提出理想的統一三維編輯可通過一種漸進視角模式實現,該模式將編輯語義從編輯顯著的視角傳播至其他編輯稀疏的視角。具體而言,我們提出了Pro3D-Editor這一新穎框架,主要包括主視角採樣器、關鍵視角渲染器及全視角精煉器。主視角採樣器動態採樣並編輯最具編輯顯著性的視角作為主視角。關鍵視角渲染器通過其多視角專家混合低秩適應(MoVE-LoRA)技術,精確地將編輯語義從主視角傳播至其他關鍵視角。全視角精煉器基於已編輯的多視角對三維物體進行編輯與精煉。大量實驗表明,我們的方法在編輯準確性和空間一致性方面均優於現有方法。
English
Text-guided 3D editing aims to precisely edit semantically relevant local 3D
regions, which has significant potential for various practical applications
ranging from 3D games to film production. Existing methods typically follow a
view-indiscriminate paradigm: editing 2D views indiscriminately and projecting
them back into 3D space. However, they overlook the different cross-view
interdependencies, resulting in inconsistent multi-view editing. In this study,
we argue that ideal consistent 3D editing can be achieved through a
progressive-views paradigm, which propagates editing semantics from
the editing-salient view to other editing-sparse views. Specifically, we
propose Pro3D-Editor, a novel framework, which mainly includes
Primary-view Sampler, Key-view Render, and Full-view Refiner. Primary-view
Sampler dynamically samples and edits the most editing-salient view as the
primary view. Key-view Render accurately propagates editing semantics from the
primary view to other key views through its Mixture-of-View-Experts Low-Rank
Adaption (MoVE-LoRA). Full-view Refiner edits and refines the 3D object based
on the edited multi-views. Extensive experiments demonstrate that our method
outperforms existing methods in editing accuracy and spatial consistency.