可驱动姿态生成:面向三维人形角色动画的前馈式隐空间姿态建模
Make-It-Poseable: Feed-forward Latent Posing Model for 3D Humanoid Character Animation
December 18, 2025
作者: Zhiyang Guo, Ori Zhang, Jax Xiang, Alan Zhao, Wengang Zhou, Houqiang Li
cs.AI
摘要
三维角色姿态设定是计算机图形学与视觉领域的基础任务。然而,现有方法如自动骨骼绑定和姿态条件生成常面临蒙皮权重预测不准、拓扑结构缺陷及姿态贴合度差等挑战,制约了其鲁棒性与泛化能力。为突破这些局限,我们提出Make-It-Poseable——一种将角色姿态设定重构为隐空间变换问题的新型前馈框架。与传统流程中变形网格顶点不同,本方法通过直接操控隐式表征实现新姿态下的角色重建。其核心在于基于骨骼运动操控形状标记的隐式姿态变换器,辅以密集姿态表征实现精准控制。为确保高保真几何并适应拓扑变化,我们还引入了隐空间监督策略与自适应补全模块。本方法在姿态质量上展现出卓越性能,并可自然扩展到部件替换与精细化等三维编辑应用。
English
Posing 3D characters is a fundamental task in computer graphics and vision. However, existing methods like auto-rigging and pose-conditioned generation often struggle with challenges such as inaccurate skinning weight prediction, topological imperfections, and poor pose conformance, limiting their robustness and generalizability. To overcome these limitations, we introduce Make-It-Poseable, a novel feed-forward framework that reformulates character posing as a latent-space transformation problem. Instead of deforming mesh vertices as in traditional pipelines, our method reconstructs the character in new poses by directly manipulating its latent representation. At the core of our method is a latent posing transformer that manipulates shape tokens based on skeletal motion. This process is facilitated by a dense pose representation for precise control. To ensure high-fidelity geometry and accommodate topological changes, we also introduce a latent-space supervision strategy and an adaptive completion module. Our method demonstrates superior performance in posing quality. It also naturally extends to 3D editing applications like part replacement and refinement.