学習可能なSMPLify：最適化不要の人間姿勢逆運動学のためのニューラルソリューション

要旨

3D人間の姿勢と形状推定において、SMPLifyは反復最適化を通じて逆運動学（IK）を解決する堅牢なベースラインとして残っています。しかし、その高い計算コストが実用性を制限しています。最近のさまざまな分野での進展により、反復最適化をデータ駆動型のニューラルネットワークに置き換えることで、精度を犠牲にすることなく大幅な実行時間の改善が達成できることが示されています。このトレンドに触発され、我々はLearnable SMPLifyを提案します。これは、SMPLifyの反復フィッティングプロセスを単一パスの回帰モデルに置き換えるニューラルフレームワークです。我々のフレームワークの設計は、ニューラルIKにおける2つの核心的な課題、すなわちデータ構築と汎化能力に焦点を当てています。効果的な学習を可能にするために、シーケンシャルフレームから初期化-ターゲットペアを構築する時間的サンプリング戦略を提案します。多様な動きや未見の姿勢に対する汎化能力を向上させるために、人間中心の正規化スキームと残差学習を提案し、解空間を狭めます。Learnable SMPLifyは、シーケンシャル推論と既存の画像ベースの推定器を洗練するためのプラグインポストプロセッシングの両方をサポートします。広範な実験により、我々の方法が実用的でシンプルなベースラインとして確立されることが示されています：SMPLifyと比較して約200倍の高速な実行時間を達成し、未見の3DPWとRICHに対して良好に汎化し、LucidActionでプラグインツールとして使用される際にモデル非依存の方法で動作します。コードはhttps://github.com/Charrrrrlie/Learnable-SMPLifyで公開されています。

English

In 3D human pose and shape estimation, SMPLify remains a robust baseline that solves inverse kinematics (IK) through iterative optimization. However, its high computational cost limits its practicality. Recent advances across domains have shown that replacing iterative optimization with data-driven neural networks can achieve significant runtime improvements without sacrificing accuracy. Motivated by this trend, we propose Learnable SMPLify, a neural framework that replaces the iterative fitting process in SMPLify with a single-pass regression model. The design of our framework targets two core challenges in neural IK: data construction and generalization. To enable effective training, we propose a temporal sampling strategy that constructs initialization-target pairs from sequential frames. To improve generalization across diverse motions and unseen poses, we propose a human-centric normalization scheme and residual learning to narrow the solution space. Learnable SMPLify supports both sequential inference and plug-in post-processing to refine existing image-based estimators. Extensive experiments demonstrate that our method establishes itself as a practical and simple baseline: it achieves nearly 200x faster runtime compared to SMPLify, generalizes well to unseen 3DPW and RICH, and operates in a model-agnostic manner when used as a plug-in tool on LucidAction. The code is available at https://github.com/Charrrrrlie/Learnable-SMPLify.

学習可能なSMPLify：最適化不要の人間姿勢逆運動学のためのニューラルソリューション

Learnable SMPLify: A Neural Solution for Optimization-Free Human Pose Inverse Kinematics

要旨

Support