ChatPaper.aiChatPaper

不再漂浮:从单张图像中重建物体-地面

Floating No More: Object-Ground Reconstruction from a Single Image

July 26, 2024
作者: Yunze Man, Yichen Sheng, Jianming Zhang, Liang-Yan Gui, Yu-Xiong Wang
cs.AI

摘要

最近关于从单个图像重建3D物体的研究主要集中在提高物体形状的准确性上。然而,这些技术通常无法准确捕捉物体、地面和摄像机之间的相互关系。因此,重建的物体通常在平面表面上呈现漂浮或倾斜的外观。这一局限严重影响了3D感知图像编辑应用,如阴影渲染和物体姿态操作。为了解决这个问题,我们引入了一项名为ORG(带地面的物体重建)的新任务,旨在重建3D物体几何形状以及地面表面。我们的方法使用两种紧凑的像素级表示来描述摄像机、物体和地面之间的关系。实验证明,所提出的ORG模型能够有效地在未见数据上重建物体-地面几何形状,与传统的单图像3D重建技术相比,显著提高了阴影生成和姿态操作的质量。
English
Recent advancements in 3D object reconstruction from single images have primarily focused on improving the accuracy of object shapes. Yet, these techniques often fail to accurately capture the inter-relation between the object, ground, and camera. As a result, the reconstructed objects often appear floating or tilted when placed on flat surfaces. This limitation significantly affects 3D-aware image editing applications like shadow rendering and object pose manipulation. To address this issue, we introduce ORG (Object Reconstruction with Ground), a novel task aimed at reconstructing 3D object geometry in conjunction with the ground surface. Our method uses two compact pixel-level representations to depict the relationship between camera, object, and ground. Experiments show that the proposed ORG model can effectively reconstruct object-ground geometry on unseen data, significantly enhancing the quality of shadow generation and pose manipulation compared to conventional single-image 3D reconstruction techniques.

Summary

AI-Generated Summary

PDF193November 28, 2024