让任意角色在任意世界中动起来 **核心功能:** - **跨角色适配**:支持动漫、游戏、真人等各类角色形象 - **场景无缝融合**:自动匹配不同世界观背景(奇幻/科幻/现代等) - **动态自然生成**:提供行走、奔跑、施法等20+基础动作模板 - **AI驱动优化**:通过深度学习自动补间关键帧,确保动作流畅度 **技术亮点:** 1. 多模态骨骼绑定系统(支持二足/四足/异形角色) 2. 实时物理模拟(布料/毛发动力学) 3. 环境光影自适应渲染 4. 一键式动作捕捉导入 **应用场景:** - 独立游戏角色动画制作 - 虚拟主播形象动态设计 - 跨媒介IP角色移植 - 教育领域虚拟教师开发 (通过拖拽上传角色立绘,选择目标世界模板,即可实时预览动态效果。支持导出GIF/MP4/精灵图序列等格式)
Animate Any Character in Any World
December 18, 2025
作者: Yitong Wang, Fangyun Wei, Hongyang Zhang, Bo Dai, Yan Lu
cs.AI
摘要
世界模型的最新进展显著提升了交互式环境模拟能力。现有方法主要分为两类:静态世界生成模型(构建无主动智能体的三维环境)和可控实体模型(允许单一实体在不可控环境中执行有限动作)。本研究提出的AniX框架,在保留静态世界生成真实感与结构基础优势的同时,将可控实体模型扩展至支持用户指定角色执行开放式动作。用户可提供三维高斯溅射场景与角色,通过自然语言指令引导角色完成从基础移动到以物体为中心的多样化交互行为,并自由探索环境。AniX通过条件自回归视频生成框架,合成具有时间一致性的视频片段,确保与原始场景和角色的视觉保真度。基于预训练视频生成器,我们的训练策略在保持动作与角色泛化能力的同时,显著提升了运动动力学表现。评估体系涵盖视觉质量、角色一致性、动作可控性及长时序连贯性等多维度指标。
English
Recent advances in world models have greatly enhanced interactive environment simulation. Existing methods mainly fall into two categories: (1) static world generation models, which construct 3D environments without active agents, and (2) controllable-entity models, which allow a single entity to perform limited actions in an otherwise uncontrollable environment. In this work, we introduce AniX, leveraging the realism and structural grounding of static world generation while extending controllable-entity models to support user-specified characters capable of performing open-ended actions. Users can provide a 3DGS scene and a character, then direct the character through natural language to perform diverse behaviors from basic locomotion to object-centric interactions while freely exploring the environment. AniX synthesizes temporally coherent video clips that preserve visual fidelity with the provided scene and character, formulated as a conditional autoregressive video generation problem. Built upon a pre-trained video generator, our training strategy significantly enhances motion dynamics while maintaining generalization across actions and characters. Our evaluation covers a broad range of aspects, including visual quality, character consistency, action controllability, and long-horizon coherence.