不再漂浮：從單張圖像重建物體-地面

摘要

最近在從單張圖像中重建3D物體方面的進展主要集中在提高物體形狀的準確性上。然而，這些技術通常無法準確捕捉物體、地面和相機之間的相互關係。因此，重建的物體在放置在平面表面時常常看起來漂浮或傾斜。這一限制顯著影響了3D感知圖像編輯應用，如陰影渲染和物體姿態操作。為了解決這個問題，我們引入了一個名為ORG（Object Reconstruction with Ground）的新任務，旨在重建3D物體幾何形狀以及地面表面。我們的方法使用兩種緊湊的像素級表示來描述相機、物體和地面之間的關係。實驗表明，所提出的ORG模型能夠有效地在未見數據上重建物體-地面幾何形狀，與傳統的單張圖像3D重建技術相比，顯著提高了陰影生成和姿態操作的質量。

English

Recent advancements in 3D object reconstruction from single images have primarily focused on improving the accuracy of object shapes. Yet, these techniques often fail to accurately capture the inter-relation between the object, ground, and camera. As a result, the reconstructed objects often appear floating or tilted when placed on flat surfaces. This limitation significantly affects 3D-aware image editing applications like shadow rendering and object pose manipulation. To address this issue, we introduce ORG (Object Reconstruction with Ground), a novel task aimed at reconstructing 3D object geometry in conjunction with the ground surface. Our method uses two compact pixel-level representations to depict the relationship between camera, object, and ground. Experiments show that the proposed ORG model can effectively reconstruct object-ground geometry on unseen data, significantly enhancing the quality of shadow generation and pose manipulation compared to conventional single-image 3D reconstruction techniques.

不再漂浮：從單張圖像重建物體-地面

Floating No More: Object-Ground Reconstruction from a Single Image

摘要

Support