ImageDream：圖像提示多視圖擴散用於3D生成

摘要

我們介紹了一種創新的圖像提示多視圖擴散模型"ImageDream"，用於3D物體生成。ImageDream以其能夠生成比現有最先進的圖像條件方法質量更高的3D模型而脫穎而出。我們的方法利用圖像中物體的標準攝像機協調，提高了視覺幾何準確性。該模型設計了各種級別的控制，在擴散模型內的每個塊基於輸入圖像，其中全局控制塑造了整體物體佈局，而局部控制微調了圖像細節。通過使用標準提示清單進行廣泛評估，證明了ImageDream的有效性。欲獲得更多信息，請訪問我們的專案頁面：https://Image-Dream.github.io。

English

We introduce "ImageDream," an innovative image-prompt, multi-view diffusion model for 3D object generation. ImageDream stands out for its ability to produce 3D models of higher quality compared to existing state-of-the-art, image-conditioned methods. Our approach utilizes a canonical camera coordination for the objects in images, improving visual geometry accuracy. The model is designed with various levels of control at each block inside the diffusion model based on the input image, where global control shapes the overall object layout and local control fine-tunes the image details. The effectiveness of ImageDream is demonstrated through extensive evaluations using a standard prompt list. For more information, visit our project page at https://Image-Dream.github.io.

ImageDream：圖像提示多視圖擴散用於3D生成

ImageDream: Image-Prompt Multi-view Diffusion for 3D Generation

摘要

Support