不再漂浮：从单张图像中重建物体-地面

摘要

最近关于从单个图像重建3D物体的研究主要集中在提高物体形状的准确性上。然而，这些技术通常无法准确捕捉物体、地面和摄像机之间的相互关系。因此，重建的物体通常在平面表面上呈现漂浮或倾斜的外观。这一局限严重影响了3D感知图像编辑应用，如阴影渲染和物体姿态操作。为了解决这个问题，我们引入了一项名为ORG（带地面的物体重建）的新任务，旨在重建3D物体几何形状以及地面表面。我们的方法使用两种紧凑的像素级表示来描述摄像机、物体和地面之间的关系。实验证明，所提出的ORG模型能够有效地在未见数据上重建物体-地面几何形状，与传统的单图像3D重建技术相比，显著提高了阴影生成和姿态操作的质量。

English

Recent advancements in 3D object reconstruction from single images have primarily focused on improving the accuracy of object shapes. Yet, these techniques often fail to accurately capture the inter-relation between the object, ground, and camera. As a result, the reconstructed objects often appear floating or tilted when placed on flat surfaces. This limitation significantly affects 3D-aware image editing applications like shadow rendering and object pose manipulation. To address this issue, we introduce ORG (Object Reconstruction with Ground), a novel task aimed at reconstructing 3D object geometry in conjunction with the ground surface. Our method uses two compact pixel-level representations to depict the relationship between camera, object, and ground. Experiments show that the proposed ORG model can effectively reconstruct object-ground geometry on unseen data, significantly enhancing the quality of shadow generation and pose manipulation compared to conventional single-image 3D reconstruction techniques.

不再漂浮：从单张图像中重建物体-地面

Floating No More: Object-Ground Reconstruction from a Single Image

摘要

Support