不再漂浮:從單張圖像重建物體-地面
Floating No More: Object-Ground Reconstruction from a Single Image
July 26, 2024
作者: Yunze Man, Yichen Sheng, Jianming Zhang, Liang-Yan Gui, Yu-Xiong Wang
cs.AI
摘要
最近在從單張圖像中重建3D物體方面的進展主要集中在提高物體形狀的準確性上。然而,這些技術通常無法準確捕捉物體、地面和相機之間的相互關係。因此,重建的物體在放置在平面表面時常常看起來漂浮或傾斜。這一限制顯著影響了3D感知圖像編輯應用,如陰影渲染和物體姿態操作。為了解決這個問題,我們引入了一個名為ORG(Object Reconstruction with Ground)的新任務,旨在重建3D物體幾何形狀以及地面表面。我們的方法使用兩種緊湊的像素級表示來描述相機、物體和地面之間的關係。實驗表明,所提出的ORG模型能夠有效地在未見數據上重建物體-地面幾何形狀,與傳統的單張圖像3D重建技術相比,顯著提高了陰影生成和姿態操作的質量。
English
Recent advancements in 3D object reconstruction from single images have
primarily focused on improving the accuracy of object shapes. Yet, these
techniques often fail to accurately capture the inter-relation between the
object, ground, and camera. As a result, the reconstructed objects often appear
floating or tilted when placed on flat surfaces. This limitation significantly
affects 3D-aware image editing applications like shadow rendering and object
pose manipulation. To address this issue, we introduce ORG (Object
Reconstruction with Ground), a novel task aimed at reconstructing 3D object
geometry in conjunction with the ground surface. Our method uses two compact
pixel-level representations to depict the relationship between camera, object,
and ground. Experiments show that the proposed ORG model can effectively
reconstruct object-ground geometry on unseen data, significantly enhancing the
quality of shadow generation and pose manipulation compared to conventional
single-image 3D reconstruction techniques.Summary
AI-Generated Summary