影像作為IMU：從單張運動模糊影像中估計相機運動

摘要

在許多機器人與虛擬/擴增實境應用中，快速的相機運動會導致高度運動模糊，使得現有的相機姿態估計方法失效。本研究提出了一種新穎的框架，將運動模糊視為運動估計的豐富線索，而非將其視為不希望的干擾。我們的方法通過從單張運動模糊圖像直接預測密集的運動流場和單目深度圖來實現。隨後，在小運動假設下，通過求解線性最小二乘問題來恢復瞬時相機速度。本質上，我們的方法產生了一種類似IMU的測量值，能夠穩健地捕捉快速且劇烈的相機運動。為了訓練我們的模型，我們構建了一個大規模數據集，其中包含基於ScanNet++v2生成的逼真合成運動模糊，並通過使用我們完全可微分的管道在真實數據上進行端到端訓練來進一步精煉模型。在真實世界基準上的廣泛評估表明，我們的方法在角速度和平移速度估計上達到了最先進的水平，超越了如MASt3R和COLMAP等現有方法。

English

In many robotics and VR/AR applications, fast camera motions cause a high level of motion blur, causing existing camera pose estimation methods to fail. In this work, we propose a novel framework that leverages motion blur as a rich cue for motion estimation rather than treating it as an unwanted artifact. Our approach works by predicting a dense motion flow field and a monocular depth map directly from a single motion-blurred image. We then recover the instantaneous camera velocity by solving a linear least squares problem under the small motion assumption. In essence, our method produces an IMU-like measurement that robustly captures fast and aggressive camera movements. To train our model, we construct a large-scale dataset with realistic synthetic motion blur derived from ScanNet++v2 and further refine our model by training end-to-end on real data using our fully differentiable pipeline. Extensive evaluations on real-world benchmarks demonstrate that our method achieves state-of-the-art angular and translational velocity estimates, outperforming current methods like MASt3R and COLMAP.

影像作為IMU：從單張運動模糊影像中估計相機運動

Image as an IMU: Estimating Camera Motion from a Single Motion-Blurred Image

摘要

Support