Pro3D-Editor：基于渐进视角实现一致且精确的3D编辑

摘要

文本引导的3D编辑旨在精确修改语义相关的局部3D区域，这一技术在从3D游戏到电影制作等多种实际应用中具有巨大潜力。现有方法通常遵循一种视图无差别范式：不加区分地编辑2D视图并将其投影回3D空间。然而，它们忽视了不同视图间的相互依赖性，导致多视图编辑结果不一致。在本研究中，我们提出理想的3D编辑一致性可以通过渐进式视图范式实现，该范式将编辑语义从编辑显著视图传播至其他编辑稀疏视图。具体而言，我们提出了Pro3D-Editor这一新颖框架，主要包括主视图采样器、关键视图渲染器和全视图优化器。主视图采样器动态采样并编辑最具编辑显著性的视图作为主视图。关键视图渲染器通过其多视图专家混合低秩适应（MoVE-LoRA）机制，准确地将编辑语义从主视图传播至其他关键视图。全视图优化器则基于编辑后的多视图对3D对象进行编辑和精细化处理。大量实验表明，我们的方法在编辑准确性和空间一致性方面均优于现有方法。

English

Text-guided 3D editing aims to precisely edit semantically relevant local 3D regions, which has significant potential for various practical applications ranging from 3D games to film production. Existing methods typically follow a view-indiscriminate paradigm: editing 2D views indiscriminately and projecting them back into 3D space. However, they overlook the different cross-view interdependencies, resulting in inconsistent multi-view editing. In this study, we argue that ideal consistent 3D editing can be achieved through a progressive-views paradigm, which propagates editing semantics from the editing-salient view to other editing-sparse views. Specifically, we propose Pro3D-Editor, a novel framework, which mainly includes Primary-view Sampler, Key-view Render, and Full-view Refiner. Primary-view Sampler dynamically samples and edits the most editing-salient view as the primary view. Key-view Render accurately propagates editing semantics from the primary view to other key views through its Mixture-of-View-Experts Low-Rank Adaption (MoVE-LoRA). Full-view Refiner edits and refines the 3D object based on the edited multi-views. Extensive experiments demonstrate that our method outperforms existing methods in editing accuracy and spatial consistency.

Pro3D-Editor：基于渐进视角实现一致且精确的3D编辑

Pro3D-Editor : A Progressive-Views Perspective for Consistent and Precise 3D Editing

摘要

Support