互動式生成影片技術綜述
A Survey of Interactive Generative Video
April 30, 2025
作者: Jiwen Yu, Yiran Qin, Haoxuan Che, Quande Liu, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Hao Chen, Xihui Liu
cs.AI
摘要
交互式生成视频(IGV)已成為一項關鍵技術,以應對各領域對高質量、互動視頻內容日益增長的需求。在本文中,我們將IGV定義為一種結合生成能力以產生多樣化高質量視頻內容,並具備互動功能,使用戶能夠通過控制信號和響應反饋進行參與的技術。我們調查了當前IGV應用的現狀,重點關注三大領域:1)遊戲,其中IGV實現了虛擬世界中的無限探索;2)具身人工智能,其中IGV作為物理感知的環境合成器,用於訓練智能體在多模態交互中與動態演變場景的互動;3)自動駕駛,其中IGV提供了閉環模擬能力,用於安全關鍵測試和驗證。為指導未來發展,我們提出了一個全面的框架,將理想的IGV系統分解為五個核心模塊:生成、控制、記憶、動態和智能。此外,我們系統地分析了實現理想IGV系統中各組件的技術挑戰和未來方向,例如實現實時生成、支持開放域控制、保持長期連貫性、模擬精確物理以及整合因果推理。我們相信,這一系統性分析將促進IGV領域的未來研究與發展,最終推動該技術向更為複雜和實用的應用邁進。
English
Interactive Generative Video (IGV) has emerged as a crucial technology in
response to the growing demand for high-quality, interactive video content
across various domains. In this paper, we define IGV as a technology that
combines generative capabilities to produce diverse high-quality video content
with interactive features that enable user engagement through control signals
and responsive feedback. We survey the current landscape of IGV applications,
focusing on three major domains: 1) gaming, where IGV enables infinite
exploration in virtual worlds; 2) embodied AI, where IGV serves as a
physics-aware environment synthesizer for training agents in multimodal
interaction with dynamically evolving scenes; and 3) autonomous driving, where
IGV provides closed-loop simulation capabilities for safety-critical testing
and validation. To guide future development, we propose a comprehensive
framework that decomposes an ideal IGV system into five essential modules:
Generation, Control, Memory, Dynamics, and Intelligence. Furthermore, we
systematically analyze the technical challenges and future directions in
realizing each component for an ideal IGV system, such as achieving real-time
generation, enabling open-domain control, maintaining long-term coherence,
simulating accurate physics, and integrating causal reasoning. We believe that
this systematic analysis will facilitate future research and development in the
field of IGV, ultimately advancing the technology toward more sophisticated and
practical applications.Summary
AI-Generated Summary