ChatPaper.aiChatPaper

不再漂浮:從單張圖像重建物體-地面

Floating No More: Object-Ground Reconstruction from a Single Image

July 26, 2024
作者: Yunze Man, Yichen Sheng, Jianming Zhang, Liang-Yan Gui, Yu-Xiong Wang
cs.AI

摘要

最近在從單張圖像中重建3D物體方面的進展主要集中在提高物體形狀的準確性上。然而,這些技術通常無法準確捕捉物體、地面和相機之間的相互關係。因此,重建的物體在放置在平面表面時常常看起來漂浮或傾斜。這一限制顯著影響了3D感知圖像編輯應用,如陰影渲染和物體姿態操作。為了解決這個問題,我們引入了一個名為ORG(Object Reconstruction with Ground)的新任務,旨在重建3D物體幾何形狀以及地面表面。我們的方法使用兩種緊湊的像素級表示來描述相機、物體和地面之間的關係。實驗表明,所提出的ORG模型能夠有效地在未見數據上重建物體-地面幾何形狀,與傳統的單張圖像3D重建技術相比,顯著提高了陰影生成和姿態操作的質量。
English
Recent advancements in 3D object reconstruction from single images have primarily focused on improving the accuracy of object shapes. Yet, these techniques often fail to accurately capture the inter-relation between the object, ground, and camera. As a result, the reconstructed objects often appear floating or tilted when placed on flat surfaces. This limitation significantly affects 3D-aware image editing applications like shadow rendering and object pose manipulation. To address this issue, we introduce ORG (Object Reconstruction with Ground), a novel task aimed at reconstructing 3D object geometry in conjunction with the ground surface. Our method uses two compact pixel-level representations to depict the relationship between camera, object, and ground. Experiments show that the proposed ORG model can effectively reconstruct object-ground geometry on unseen data, significantly enhancing the quality of shadow generation and pose manipulation compared to conventional single-image 3D reconstruction techniques.

Summary

AI-Generated Summary

PDF193November 28, 2024