ImageDream：基于图像提示的多视角扩散用于3D生成

摘要

我们介绍了一种创新的图像提示多视角扩散模型“ImageDream”，用于3D物体生成。与现有的最先进的基于图像的方法相比，“ImageDream”以其生成质量更高的3D模型脱颖而出。我们的方法利用图像中物体的规范相机协调，提高了视觉几何精度。该模型在扩散模型内的每个块中根据输入图像设计了各种控制级别，全局控制塑造整体物体布局，而局部控制微调图像细节。通过使用标准提示列表进行广泛评估展示了“ImageDream”的有效性。欲了解更多信息，请访问我们的项目页面https://Image-Dream.github.io。

English

We introduce "ImageDream," an innovative image-prompt, multi-view diffusion model for 3D object generation. ImageDream stands out for its ability to produce 3D models of higher quality compared to existing state-of-the-art, image-conditioned methods. Our approach utilizes a canonical camera coordination for the objects in images, improving visual geometry accuracy. The model is designed with various levels of control at each block inside the diffusion model based on the input image, where global control shapes the overall object layout and local control fine-tunes the image details. The effectiveness of ImageDream is demonstrated through extensive evaluations using a standard prompt list. For more information, visit our project page at https://Image-Dream.github.io.

ImageDream：基于图像提示的多视角扩散用于3D生成

ImageDream: Image-Prompt Multi-view Diffusion for 3D Generation

摘要

Support