자율 주행을 위한 3D 점유율 예측을 위한 확산 기반 생성 모델

초록

시각적 입력으로부터 3D 점유 그리드를 정확하게 예측하는 것은 자율 주행에 있어 매우 중요하지만, 현재의 판별적 방법들은 노이즈가 있는 데이터, 불완전한 관측, 그리고 3D 장면에 내재된 복잡한 구조를 다루는 데 어려움을 겪고 있습니다. 본 연구에서는 3D 점유 예측을 확산 모델을 사용한 생성 모델링 작업으로 재구성합니다. 이 모델은 기본 데이터 분포를 학습하고 3D 장면 사전 정보를 통합하여 예측의 일관성과 노이즈 강인성을 향상시키며, 3D 공간 구조의 복잡성을 더 잘 처리합니다. 우리의 광범위한 실험 결과, 확산 기반 생성 모델은 최첨단 판별적 접근법을 능가하며, 특히 가려진 영역이나 시야가 낮은 영역에서 더 현실적이고 정확한 점유 예측을 제공합니다. 또한, 개선된 예측 결과는 하위 계획 작업에 상당한 이점을 제공하여, 실제 자율 주행 애플리케이션에서 우리 방법의 실용적 장점을 강조합니다.

English

Accurately predicting 3D occupancy grids from visual inputs is critical for autonomous driving, but current discriminative methods struggle with noisy data, incomplete observations, and the complex structures inherent in 3D scenes. In this work, we reframe 3D occupancy prediction as a generative modeling task using diffusion models, which learn the underlying data distribution and incorporate 3D scene priors. This approach enhances prediction consistency, noise robustness, and better handles the intricacies of 3D spatial structures. Our extensive experiments show that diffusion-based generative models outperform state-of-the-art discriminative approaches, delivering more realistic and accurate occupancy predictions, especially in occluded or low-visibility regions. Moreover, the improved predictions significantly benefit downstream planning tasks, highlighting the practical advantages of our method for real-world autonomous driving applications.

자율 주행을 위한 3D 점유율 예측을 위한 확산 기반 생성 모델

Diffusion-Based Generative Models for 3D Occupancy Prediction in Autonomous Driving

초록

Support