发丝守护者:深度、立体与新视角中的软边界重建技术
Guardians of the Hair: Rescuing Soft Boundaries in Depth, Stereo, and Novel Views
January 6, 2026
作者: Xiang Zhang, Yang Zhang, Lukas Mehl, Markus Gross, Christopher Schroers
cs.AI
摘要
软边界(如发丝等细微结构)在自然图像和计算机生成图像中普遍存在,但由于前景与背景线索的模糊混合,其三维视觉重建仍具挑战性。本文提出发丝守护者框架,该框架专为恢复三维视觉任务中的细粒度软边界细节而设计。具体而言,我们首创基于图像抠图数据集训练的数据处理流程,并设计深度修复网络自动识别软边界区域。通过门控残差模块,该网络能在保持全局深度质量的同时精准优化软边界周围的深度信息,实现与前沿深度模型的即插即用式集成。在视图合成方面,我们采用基于深度的前向变形以保留高保真纹理,继而通过生成式场景绘制器填充遮挡移除区域并消除软边界内的冗余背景伪影。最终,色彩融合模块自适应地结合变形与修复结果,生成具有几何一致性和细粒度细节的新视角图像。大量实验表明,HairGuard在单目深度估计、立体图像/视频转换及新视角合成任务中均实现最先进性能,尤其在软边界区域取得显著提升。
English
Soft boundaries, like thin hairs, are commonly observed in natural and computer-generated imagery, but they remain challenging for 3D vision due to the ambiguous mixing of foreground and background cues. This paper introduces Guardians of the Hair (HairGuard), a framework designed to recover fine-grained soft boundary details in 3D vision tasks. Specifically, we first propose a novel data curation pipeline that leverages image matting datasets for training and design a depth fixer network to automatically identify soft boundary regions. With a gated residual module, the depth fixer refines depth precisely around soft boundaries while maintaining global depth quality, allowing plug-and-play integration with state-of-the-art depth models. For view synthesis, we perform depth-based forward warping to retain high-fidelity textures, followed by a generative scene painter that fills disoccluded regions and eliminates redundant background artifacts within soft boundaries. Finally, a color fuser adaptively combines warped and inpainted results to produce novel views with consistent geometry and fine-grained details. Extensive experiments demonstrate that HairGuard achieves state-of-the-art performance across monocular depth estimation, stereo image/video conversion, and novel view synthesis, with significant improvements in soft boundary regions.