ChatPaper.aiChatPaper

MotionCtrl:一种统一灵活的视频生成运动控制器

MotionCtrl: A Unified and Flexible Motion Controller for Video Generation

December 6, 2023
作者: Zhouxia Wang, Ziyang Yuan, Xintao Wang, Tianshui Chen, Menghan Xia, Ping Luo, Ying Shan
cs.AI

摘要

视频中的运动主要包括由摄像机移动引起的摄像机运动和由物体移动导致的物体运动。准确控制摄像机和物体运动对于视频生成至关重要。然而,现有研究要么主要关注一种类型的运动,要么未明确区分这两种类型,从而限制了它们的控制能力和多样性。因此,本文提出了MotionCtrl,这是一个统一灵活的视频生成运动控制器,旨在有效独立地控制摄像机和物体运动。MotionCtrl的架构和训练策略经过精心设计,考虑了摄像机运动、物体运动和不完美训练数据的固有属性。与先前的方法相比,MotionCtrl具有三个主要优势:1)它有效独立地控制摄像机运动和物体运动,实现更精细的运动控制,促进两种类型运动的灵活多样组合。2)其运动条件由摄像机姿势和轨迹确定,这些条件与外观无关,最小程度地影响生成视频中物体的外观或形状。3)它是一个相对通用的模型,一旦训练完成,就可以适应各种摄像机姿势和轨迹。进行了大量的定性和定量实验,以展示MotionCtrl相对于现有方法的优越性。
English
Motions in a video primarily consist of camera motion, induced by camera movement, and object motion, resulting from object movement. Accurate control of both camera and object motion is essential for video generation. However, existing works either mainly focus on one type of motion or do not clearly distinguish between the two, limiting their control capabilities and diversity. Therefore, this paper presents MotionCtrl, a unified and flexible motion controller for video generation designed to effectively and independently control camera and object motion. The architecture and training strategy of MotionCtrl are carefully devised, taking into account the inherent properties of camera motion, object motion, and imperfect training data. Compared to previous methods, MotionCtrl offers three main advantages: 1) It effectively and independently controls camera motion and object motion, enabling more fine-grained motion control and facilitating flexible and diverse combinations of both types of motion. 2) Its motion conditions are determined by camera poses and trajectories, which are appearance-free and minimally impact the appearance or shape of objects in generated videos. 3) It is a relatively generalizable model that can adapt to a wide array of camera poses and trajectories once trained. Extensive qualitative and quantitative experiments have been conducted to demonstrate the superiority of MotionCtrl over existing methods.
PDF222December 15, 2024