提升乒乓球技术:三维轨迹与旋转估测的稳健实际应用
Uplifting Table Tennis: A Robust, Real-World Application for 3D Trajectory and Spin Estimation
November 25, 2025
作者: Daniel Kienzle, Katja Ludwig, Julian Lorenz, Shin'ichi Satoh, Rainer Lienhart
cs.AI
摘要
从单目视频中精确获取乒乓球的三维运动轨迹是一项具有挑战性的任务,因为基于合成数据训练的现有方法难以泛化到现实世界中存在噪声及不完美球体与球台检测的场景。这主要源于真实视频数据本身缺乏三维真实轨迹和旋转标注。为解决这一问题,我们提出了一种新颖的两阶段流程,将任务划分为前端感知任务与后端二维至三维提升任务。这种分离策略使我们能够利用新构建的TTHQ数据集中的海量二维标注训练前端组件,而后端提升网络则仅在符合物理规律的合成数据上进行训练。我们特别对提升模型进行重构,使其对漏检、帧率波动等常见现实干扰具有鲁棒性。通过整合球体检测器与球台关键点检测器,本方法将概念验证性的提升技术转化为实用、鲁棒且高性能的端到端三维乒乓球轨迹与旋转分析系统。
English
Obtaining the precise 3D motion of a table tennis ball from standard monocular videos is a challenging problem, as existing methods trained on synthetic data struggle to generalize to the noisy, imperfect ball and table detections of the real world. This is primarily due to the inherent lack of 3D ground truth trajectories and spin annotations for real-world video. To overcome this, we propose a novel two-stage pipeline that divides the problem into a front-end perception task and a back-end 2D-to-3D uplifting task. This separation allows us to train the front-end components with abundant 2D supervision from our newly created TTHQ dataset, while the back-end uplifting network is trained exclusively on physically-correct synthetic data. We specifically re-engineer the uplifting model to be robust to common real-world artifacts, such as missing detections and varying frame rates. By integrating a ball detector and a table keypoint detector, our approach transforms a proof-of-concept uplifting method into a practical, robust, and high-performing end-to-end application for 3D table tennis trajectory and spin analysis.